AdTechTalent
Engineering13 days agoHybrid

Criteo

Senior Site Reliability Engineer (Data & Analytics Infrastructure)

site reliability engineeringSREscalajavapythonrubydockerkubernetesmesoschefverticaprestodruidtableaudistributed systemsanalytics infrastructureautomationmonitoringincident managementCI/CDinfrastructure as codelinuxdevops

Key details

Salary

Not specified

Employment type

Permanent Full Time

Seniority

Senior

Years experience

5-10

Location

Grenoble, France; Paris, France

Full job description

Senior Site Reliability Engineer role in the Platform Core Analytics Infrastructure team. Responsibilities include designing, building, and operating large-scale multi-datacenter analytic systems (Vertica, Presto, Druid, Tableau), ensuring high reliability and low latency. Build and maintain automation stacks using Chef, Mesos, Docker, Kubernetes. Develop internal tools primarily in Scala, with Python or Ruby. Own reliability lifecycle including monitoring, troubleshooting, incident management, and continuous improvement. Collaborate with product teams and contribute to architectural vision. Requirements: Master's or PhD in Computer Science or equivalent, 5+ years in backend engineering/SRE/DevOps, strong coding skills in Scala/Java, scripting in Python/Ruby, deep knowledge of distributed systems and analytic databases, Linux/UNIX experience, container and orchestration expertise, strong SRE mindset, familiarity with CI/CD and infra-as-code, excellent communication and teamwork skills, willingness for on-call rotation. Hybrid work model with offices in Paris and Grenoble. Benefits include career development, health and wellness support, inclusive culture, competitive salary with performance rewards and potential equity.

What you'll do

  • Design, build, and evolve multi-datacenter, large-scale analytic systems focusing on performance, scalability, and resilience
  • Operate and improve core analytical data stores: Vertica, Presto, Druid, Tableau, and related systems
  • Ensure high reliability and low latency for systems accessed by diverse internal and external clients
  • Build and maintain automation stacks end-to-end using tools such as Chef, Mesos, Docker, Kubernetes
  • Develop internal management, deployment, and monitoring tools primarily in Scala, with opportunities to use Python or Ruby
  • Enhance self-service capabilities to empower engineering teams while maintaining high operational standards
  • Own the full lifecycle of reliability: monitoring, troubleshooting, on-call rotation, incident management, root cause analysis, and long-term fixes
  • Define and implement best practices for observability, automation, SLIs/SLOs, capacity management, and cost/performance optimization
  • Drive technical innovation, migrations (including to Kubernetes), and reliability improvements across platform components
  • Partner with product teams, data stakeholders, and platform engineers to ensure consistency, usability, and performance
  • Share technical vision, contribute architectural proposals, and influence system evolution across the organization
  • Document processes and systems extensively to improve onboarding and daily operations

Requirements

  • Master's or PhD in Computer Science or equivalent experience
  • 5+ years in backend engineering, SRE, DevOps, or distributed systems
  • Strong coding skills in Scala, Java, or another statically typed language
  • Scripting experience in Python or Ruby
  • Deep understanding of distributed systems design, tuning, scalability, reliability
  • Experience with analytic databases and query optimization
  • Solid experience with Linux/UNIX environments
  • Hands-on knowledge of container technologies (Docker) and orchestration systems (Mesos, Kubernetes)
  • Strong SRE mindset with passion for automation, observability, reliability, documentation, and reducing toil
  • Familiarity with CI/CD pipelines, configuration management, and infrastructure-as-code
  • Excellent communication skills
  • Team player with proactive mindset
  • Comfortable working in high-scale environments and taking ownership of complex problems
  • Willingness to join on-call rotation

Tech stack

ScalaJavaPythonRubyChefMesosDockerKubernetesVerticaPrestoDruidTableauLinux/UNIX

Benefits

Hybrid working model blending home and officeLearning, mentorship, and career development programsHealth benefits, wellness perks, and mental health supportDiverse, inclusive, and globally connected teamAttractive salary with performance-based rewards and family-friendly policiesPotential for equity depending on role and level

Apply now

This MVP uses a placeholder application flow. In production, this section can connect to an external apply URL or a native application form.

Similar jobs

More roles worth a look

Related opportunities based on specialty and working model so candidates can keep momentum.