Full job description
Senior Site Reliability Engineer role in the Platform Engineering team. Responsibilities include owning and improving production services lifecycle, partnering with development teams for system design and readiness, automating operations to enhance reliability and performance, maintaining live systems with monitoring and incident management, participating in incident response and on-call rotation, developing tooling and libraries, supporting observability and performance initiatives, and leading technical migrations. Required experience: 5+ years in SRE, Platform Engineering, Software Engineering, or DevOps with strong software engineering skills in at least one modern programming language. Must have experience with large-scale distributed systems, systems engineering fundamentals, debugging production issues, and automation of operational workflows. Benefits include hybrid work model, career development, health and wellness support, inclusive team environment, competitive salary with performance rewards, and potential equity. Locations: Paris and Grenoble, France.
What you'll do
- Own and improve the full lifecycle of production services from design and deployment to operation and continuous improvement
- Partner with development teams before launch through system design reviews, platform and framework development, capacity planning, and production readiness assessments
- Improve reliability, scalability, and performance by automating operations and driving infrastructure and platform enhancements
- Maintain and optimize live systems through monitoring, observability, performance analysis, and incident management
- Participate in incident response and contribute to a culture of blameless postmortems and continuous learning
- Develop and maintain tooling and libraries (Python, Jenkins, Chef)
- Support observability and performance initiatives
- Lead technical migrations across infrastructure and core dependencies
- Participate in an on-call rotation to help ensure production stability
Requirements
- Strong software engineering experience with at least one modern programming language
- 5+ years of experience in Site Reliability Engineering, Platform Engineering, Software Engineering, or DevOps roles
- Experience designing, operating, and troubleshooting large-scale distributed systems in production environments
- Solid understanding of systems engineering fundamentals, including compute, networking, storage, and observability
- Hands-on experience debugging production issues, optimizing system performance, and automating operational workflows
- Ability to write clean, maintainable, and reliable code used in production systems and internal platforms
- Strong analytical and problem-solving skills, with the ability to collaborate effectively across engineering teams
- Curiosity, ownership, and a pragmatic mindset toward improving reliability and developer productivity
Tech stack
C#JavaScalaPythonGoPrometheusGrafanaKibanaLinuxKubernetesMesosJenkinsChef
Benefits
Hybrid work model blending home and in-office experiencesLearning, mentorship & career development programsHealth benefits, wellness perks & mental health supportDiverse, inclusive, and globally connected teamAttractive salary with performance-based rewards and family-friendly policiesPotential for equity depending on role and level