Full job description
Senior Software Engineer role in the Cleanroom team responsible for delivering large-scale cloud-native data platforms on AWS using Databricks and distributed processing frameworks. Requires hands-on work with Python, PySpark, Apache Spark, Databricks, AWS services, event-driven architectures, and SQL/NoSQL databases. Responsibilities include partnering with global teams, owning the software development lifecycle, designing scalable components, participating in architecture and code reviews, and mentoring junior engineers. Qualifications include a degree in Computer Science or related field, 5-8 years of software engineering experience with expertise in data engineering and distributed systems, strong skills in Databricks, Python, PySpark, Apache Spark, AWS services, messaging technologies, databases, testing strategies, and CI/CD/DevOps practices. Nice to have certifications in AWS and Databricks, experience with Azure/GCP, and exposure to generative AI technologies.
What you'll do
- Deliver large-scale cloud-native data platforms primarily on AWS using Databricks and distributed processing frameworks
- Work hands-on across the technology stack including Python, PySpark, Apache Spark, Databricks, AWS services, event-driven architectures, and SQL/NoSQL databases
- Partner with global engineering, product management, architecture, and business stakeholders
- Own the end-to-end software development lifecycle including requirements gathering, solution design, development, deployment, observability, and documentation
- Design and develop reusable, maintainable and scalable components
- Participate in architecture discussions, technical design reviews, and code reviews
- Mentor and guide junior engineers
- Ensure the Cleanroom platform scales securely and reliably to support sustained business growth
Requirements
- B.E/B.Tech/M.Tech/MCA in Computer Science, Information Technology or related field
- 5-8 years of strong software engineering experience
- Significant expertise in large-scale data engineering and distributed systems architecture
- Experience in Data Warehousing, Data Lakes, Delta Lake architecture, and modern big data ecosystem designs
- Deep hands-on expertise in Databricks, Python, PySpark, and Apache Spark
- Strong experience with AWS services such as S3, Glue, Athena and Lambda
- Experience with messaging technologies such as Kafka and RabbitMQ
- Strong expertise in relational and NoSQL databases including SQL Server and MongoDB
- Experience implementing robust testing strategies including unit, integration, and regression testing
- Strong understanding of CI/CD and DevOps practices using Jenkins, GitHub/GitLab, Bitbucket, and automated deployment pipelines
- Strong critical thinking and analytical skills
- Nice to have: AWS and Databricks certifications
- Nice to have: Experience working with Azure and/or Google Cloud Platform (GCP)
- Nice to have: Exposure to Generative AI technologies including LLMs, RAG architectures, and Agentic AI systems
Tech stack
AWSDatabricksPythonPySparkApache SparkSQLNoSQLKafkaRabbitMQSQL ServerMongoDBJenkinsGitHubGitLabBitbucketS3GlueAthenaLambda
Benefits
Employee well-being focusCollaborative work environmentOpportunities for growth through learning, development and career advancementInnovation-driven cultureWork-life balance and flexibilityDiversity, inclusion, and equal employment opportunities