Full job description
The Automotive Practice at Epsilon is seeking a senior software/data engineer with 10-14 years of experience to design, build, and scale data pipelines and lead architecture of modern data warehouse platforms (batch and real-time). The role requires expertise in Big Data ecosystems (Spark, Hadoop), proficiency in SQL and programming languages (Python, Scala, Java), and hands-on experience with AWS cloud services (EMR, Glue, S3, Lambda). Responsibilities include driving best practices in data engineering, collaborating with product and analytics teams, mentoring engineers, and ensuring scalable, reliable data solutions. Preferred experience includes modern data warehouses (Redshift, Snowflake, Databricks), MPP databases, ML/AI data pipelines, CI/CD, Terraform, Kubernetes, Docker, and knowledge of data governance and security.
What you'll do
- Design, build, and scale data pipelines to support enterprise reporting and analytics across large, complex datasets
- Lead architecture and implementation of modern data warehouse platforms (batch + real-time) to enable reliable, scalable reporting and BI use cases
- Drive adoption of best practices in data engineering, coding standards, and cloud infrastructure
- Partner with product, analytics, and engineering teams to deliver reliable data solutions
- Mentor engineers, conduct design/code reviews, and raise the technical bar across the team
Requirements
- 10–14 years of experience in software/data engineering with proven track record of impact
- Strong expertise in Big Data ecosystems (Spark, Hadoop, etc.)
- Proficiency in SQL, Python/Scala/Java for data processing
- Hands-on experience with cloud platforms (AWS preferred) – EMR, Glue, S3, Lambda, etc.
- Experience with data modeling, warehousing, and distributed systems
- Strong knowledge of enterprise DWH systems for reporting and analytics (e.g., schema design, query optimization, BI integration)
- Deep understanding of scalability, performance optimization, and fault tolerance
- Strong problem-solving, debugging, and system design skills
- Good to Have: Experience with Redshift or other modern data warehouse platforms (Snowflake, Databricks, BigQuery)
- Exposure to Greenplum or other MPP databases
- Exposure to ML/AI data pipelines or advanced analytics platforms
- Experience with CI/CD, Terraform, Kubernetes, Docker in data environments
- Knowledge of data governance, security, and compliance practices
Tech stack
SparkHadoopSQLPythonScalaJavaAWSEMRGlueS3LambdaRedshiftSnowflakeDatabricksBigQueryGreenplumML/AI data pipelinesCI/CDTerraformKubernetesDocker
Benefits
Employee well-being focusCollaborative work environmentOpportunities for growth through learning, development and career advancementInnovation-driven cultureWork-life balance and flexibility