SRE/Devops with Python

About

OVHcloud, leader européen du cloud computing, est présent dans une quinzaine de pays et fournit des solutions d’hébergement et de cloud sécurisées, fiables et accessibles.

Dans un monde où le numérique occupe une place prédominante et est en constante évolution, nous croyons que l’avenir réside dans un cloud ouvert, fiable et durable, qui permet aux utilisateurs de choisir en toute liberté la façon dont ils souhaitent gérer leurs données.

Nous privilégions toujours le collectif, c’est pourquoi nous travaillons de manière rapprochée avec et pour notre écosystème composé tout d'abord de nos collaboratrices et collaborateurs, de nos clients, de nos partenaires et d’acteurs institutionnels.

  • Fondée en 1999

  • ∼3000 collaborateurs dans 14 pays

  • 46 datacentres

Job Description

As a Public Cloud SRE focused on OpenStack, you will be the cornerstone in ensuring the reliability, performance, and scalability of our cloud infrastructure. You will be responsible for maintaining high service availability, implementing monitoring and driving continuous improvements in our production environment.

Main Missions of Unit that you would be part of:
• Public Cloud offering: Design and implement the architecture of the public cloud infrastructure, ensuring it meets the evolving needs of customers and the business.
• Infrastructure operations: Operate and maintain the public cloud infrastructure, ensuring its reliability, availability, and performance with cross-functional teams.
• Quality and Reliability: Continuously monitor and improve the quality and reliability of the public cloud infrastructure, ensuring high uptime and minimal disruptions.
• Security and Compliance: Ensure the security and compliance of the public cloud infrastructure, adhering to industry standards and regulatory requirements.
• Documentation & Standards: maintain detailed documentation of processes, incident responses, and system architecture to uphold transparency and continuous learning.

After 6 Months You will:
• Understand the Landscape: develop a deep understanding of our OpenStack environment, internal processes and operational workflows.
• Establish Metrics: contribute to defining and refining key reliability metrics (SLIs, SLOs, and error budgets) tailored to our services.
• Engage in Incident Management: begin taking ownership of incident responses and participate in root cause analyses with cross-functional teams.

After 1 Year You will:
• Drive Strategic Initiatives: play an instrumental role in defining the long-term reliability roadmap, integrating new tools and practices to further stabilize our services.
• Lead Operational Excellence: own major reliability projects from conception to implementation, ensuring that our systems meet or exceed performance and uptime targets.

Required skills for this role:

Hardskills:
• Coding skills: proficiency in development using Python / Golang / Bash or similar language
• Infrastructure Management: hands-on experience in managing IAC and optimizing it (Puppet, Ansible, Terraform, Kubernetes, Docker)
• Cloud knowledge: skilled in overseeing and maintaining cloud infrastructure
• Monitoring & automation: experience with monitoring, logging and alerting systems combined with automating repetitive tasks (Prometheus, Grafana)
• SRE methodologies: expertise in applying SRE practices by maintaining large software systems and monitoring it

Softskills:
• Collaborative mindset: strong communication skills and the ability to work effectively within cross-functional and remote teams
• Performance tuning: strong analytical skills to interpret system metrics and optimize infrastructure performance.
• Language Proficiency: Fluent in English

Cette offre ne répond pas tout à fait à vos attentes ? Candidatez malgré tout !
C'est l'occasion de partager votre profil avec nos recruteurs, vous faire remarquer et peut-être recontacter pour une autre opportunité.

Did this offer not quite meet your expectations? Submit a spontaneous application on our candidate portal to join one of our teams!
It's a great opportunity to share your profile with our recruiters, get noticed, and potentially be contacted for a different opportunity.

Additional Information

  • Contract Type: Full-Time
  • Location: Wrocław
  • Unknown