Senior Site Reliability Engineer (Data & Analytics Infrastructure)

site reliability engineeringSREscalajavapythonrubydockerkubernetesmesoschefverticaprestodruidtableaudistributed systemsanalytics infrastructureautomationmonitoringincident managementCI/CDinfrastructure as codelinuxdevops

Key details

Salary

Not specified

Employment type

Permanent Full Time

Seniority

Senior

Years experience

5-10

Location

Grenoble, France; Paris, France

Full job description

Senior Site Reliability Engineer role in the Platform Core Analytics Infrastructure team. Responsibilities include designing, building, and operating large-scale multi-datacenter analytic systems (Vertica, Presto, Druid, Tableau), ensuring high reliability and low latency. Build and maintain automation stacks using Chef, Mesos, Docker, Kubernetes. Develop internal tools primarily in Scala, with Python or Ruby. Own reliability lifecycle including monitoring, troubleshooting, incident management, and continuous improvement. Collaborate with product teams and contribute to architectural vision. Requirements: Master's or PhD in Computer Science or equivalent, 5+ years in backend engineering/SRE/DevOps, strong coding skills in Scala/Java, scripting in Python/Ruby, deep knowledge of distributed systems and analytic databases, Linux/UNIX experience, container and orchestration expertise, strong SRE mindset, familiarity with CI/CD and infra-as-code, excellent communication and teamwork skills, willingness for on-call rotation. Hybrid work model with offices in Paris and Grenoble. Benefits include career development, health and wellness support, inclusive culture, competitive salary with performance rewards and potential equity.

What you'll do

Design, build, and evolve multi-datacenter, large-scale analytic systems focusing on performance, scalability, and resilience
Operate and improve core analytical data stores: Vertica, Presto, Druid, Tableau, and related systems
Ensure high reliability and low latency for systems accessed by diverse internal and external clients
Build and maintain automation stacks end-to-end using tools such as Chef, Mesos, Docker, Kubernetes
Develop internal management, deployment, and monitoring tools primarily in Scala, with opportunities to use Python or Ruby
Enhance self-service capabilities to empower engineering teams while maintaining high operational standards
Own the full lifecycle of reliability: monitoring, troubleshooting, on-call rotation, incident management, root cause analysis, and long-term fixes
Define and implement best practices for observability, automation, SLIs/SLOs, capacity management, and cost/performance optimization
Drive technical innovation, migrations (including to Kubernetes), and reliability improvements across platform components
Partner with product teams, data stakeholders, and platform engineers to ensure consistency, usability, and performance
Share technical vision, contribute architectural proposals, and influence system evolution across the organization
Document processes and systems extensively to improve onboarding and daily operations

Requirements

Master's or PhD in Computer Science or equivalent experience
5+ years in backend engineering, SRE, DevOps, or distributed systems
Strong coding skills in Scala, Java, or another statically typed language
Scripting experience in Python or Ruby
Deep understanding of distributed systems design, tuning, scalability, reliability
Experience with analytic databases and query optimization
Solid experience with Linux/UNIX environments
Hands-on knowledge of container technologies (Docker) and orchestration systems (Mesos, Kubernetes)
Strong SRE mindset with passion for automation, observability, reliability, documentation, and reducing toil
Familiarity with CI/CD pipelines, configuration management, and infrastructure-as-code
Excellent communication skills
Team player with proactive mindset
Comfortable working in high-scale environments and taking ownership of complex problems
Willingness to join on-call rotation

Tech stack

ScalaJavaPythonRubyChefMesosDockerKubernetesVerticaPrestoDruidTableauLinux/UNIX

Benefits

Hybrid working model blending home and officeLearning, mentorship, and career development programsHealth benefits, wellness perks, and mental health supportDiverse, inclusive, and globally connected teamAttractive salary with performance-based rewards and family-friendly policiesPotential for equity depending on role and level

Apply now

This MVP uses a placeholder application flow. In production, this section can connect to an external apply URL or a native application form.

Continue to application Browse more jobs

Similar jobs

More roles worth a look

Related opportunities based on specialty and working model so candidates can keep momentum.

The Trade Desk

Sr AI Engineer

Bellevue, Washington, United States•19 days ago

$125K – $229K

pythonc#sql

View job details→

The Trade Desk

Sr AI Enterprise Engineer

Bellevue, Washington, United States•19 days ago

$125K – $229K

AIlarge language modelsLLM

View job details→

The Trade Desk

Sr AI Enablement Engineer

Bellevue, Washington, United States•19 days ago

$125K – $229K

AIlarge language modelsLLM

View job details→