Full job description
Senior Staff Software Engineer role in the Cleanroom team responsible for architecture, technical strategy, and execution of large-scale cloud-native data platforms primarily on AWS, with exposure to Azure and GCP. Requires hands-on expertise in Python, PySpark, Apache Spark, Databricks, AWS services, event-driven architectures, and SQL/NoSQL databases. Lead technical initiatives on performance, scalability, reliability, security, governance, and cost efficiency. Own full software development lifecycle and mentor engineers. Requires 12-16 years experience in software engineering, strong expertise in data engineering, distributed systems, data warehousing, data lakes, and big data ecosystems. Experience with AWS services, real-time streaming technologies, relational and NoSQL databases, Infrastructure as Code tools, CI/CD and DevOps practices. Exposure to Generative AI technologies is required. Nice to have AWS/Databricks certifications, Azure/GCP experience, TypeScript, Node.js, UX design, and cleanroom environment experience.
What you'll do
- Architect, design, and deliver large-scale cloud-native data platforms primarily on AWS
- Work hands-on with Python, PySpark, Apache Spark, Databricks, AWS services, event-driven architectures, SQL/NoSQL databases
- Lead technical initiatives on performance, scalability, reliability, security, governance, and cost efficiency
- Review and influence architectural decisions and establish engineering best practices
- Partner with global engineering, product management, architecture, and business stakeholders
- Own end-to-end software development lifecycle including requirements, design, development, deployment, observability, and documentation
- Mentor and guide senior and junior engineers
- Foster a culture of innovation, accountability, collaboration, and technical excellence
Requirements
- B.E/B.Tech/M.Tech/MCA in Computer Science, Information Technology or related field
- 12-16 years of software engineering experience
- Expertise in large-scale data engineering and distributed systems architecture
- Expertise in Data Warehousing, Data Lakes, Delta Lake architecture
- Hands-on experience with Databricks, Python, PySpark, Apache Spark
- Experience with AWS services (S3, Glue, Redshift, EMR, Athena, Lambda, EventBridge)
- Experience with real-time streaming technologies (Kafka, AWS Kinesis, SQS, RabbitMQ)
- Knowledge of relational and NoSQL databases
- Experience with Infrastructure as Code tools (Terraform, Ansible)
- Understanding of CI/CD and DevOps practices
- Ability to lead complex engineering programs and influence technical direction
- Exposure to Generative AI technologies and AI-driven solutions
- Nice to have: AWS and Databricks certifications, experience with Azure/GCP
- Nice to have: Working knowledge of TypeScript, Node.js, UX design & development
- Experience building data platforms in privacy-safe or cleanroom environments
Tech stack
AWSAzureGCPDatabricksPythonPySparkApache SparkSQLNoSQLPostgreSQLSQL ServerAuroraRDSMongoDBDynamoDBTerraformAnsibleJenkinsGitHubGitLabBitbucketGoCDKafkaAWS KinesisSQSRabbitMQTypeScriptNode.jsUX designGenerative AILLMsRAG architecturesAgentic AI
Benefits
Employee well-being focusCollaborative work environmentOpportunities for growth, learning, and career advancementInnovation-driven cultureWork-life balance and flexibilityDiversity and inclusion commitment