Senior Software Engineer

databrickshadoopsparkpysparkhivesqldelta lakeawsdata engineeringetlperformance tuningbig datadata pipelinesjiragitbitbucket

Key details

Salary

Not specified

Employment type

Full-time

Seniority

Mid-level

Years experience

5-10

Location

Bengaluru, India

Full job description

The Data Engineering team at Epsilon is seeking a mid-level Data Engineer with 5-8 years of experience to develop, unit test, and implement data applications primarily in the Hadoop ecosystem and Databricks. The role involves designing, building, and optimizing scalable data pipelines and ETL frameworks using Spark, PySpark, Hive, SQL, and Delta Lake. Candidates must have hands-on experience with Databricks, AWS, performance tuning for large datasets, and tools like JIRA and GIT/Bitbucket. Responsibilities include collaborating with technical leads and architects, contributing to data modeling and governance, troubleshooting production issues, and improving deployment processes. The position is based in Bengaluru, Karnataka, India.

What you'll do

Design, develop, and maintain data pipelines and ETL frameworks using Spark, PySpark, Hive, and SQL
Design and develop scalable data pipelines and processing frameworks on Databricks using PySpark, SQL, and Delta Lake
Develop efficient, reusable, and reliable code for data processing and transformation
Optimize and tune Spark jobs for performance and scalability
Work with Technical Leads, Architects and data platform teams to implement robust data solutions
Contribute to data modeling, quality, and governance initiatives
Perform detailed analysis, troubleshooting, and root cause analysis for production issues
Participate in code reviews and enforce best coding and design practices
Collaborate with cross-functional teams to deliver high-quality software solutions
Improve and optimize deployment challenges and help deliver reliable solutions
Interact with technical leads and architects to solve challenges faced by Data Engineering teams
Contribute to continuous improvement of development and delivery processes

Requirements

BE / B.Tech / MCA – No correspondence course
5-8 years of experience
Hands-on experience in building and optimizing data solutions on the Hadoop ecosystem leveraging PySpark
Good knowledge and experience with Databricks
Experience working with AWS
Experience with performance tuning for large data sets
Experience with JIRA for user-story/bug tracking
Experience with GIT/Bitbucket

Tech stack

HadoopDatabricksSparkPySparkHiveSQLDelta LakeAWSJIRAGITBitbucket

Benefits

Opportunity to design, build, and optimize large-scale data solutionsExposure to diverse data engineering challengesHands-on experience with modern data platforms, cloud ecosystems, and automation frameworksCollaborative and agile work environmentBeing part of a global Data team impacting data-driven decision-makingEndless opportunities for growth through learning, development and career advancementFocus on employee well-beingWork-life balance and flexibilityDiversity and inclusion commitment

Apply now

This MVP uses a placeholder application flow. In production, this section can connect to an external apply URL or a native application form.

Continue to application Browse more jobs

Similar jobs

More roles worth a look

Related opportunities based on specialty and working model so candidates can keep momentum.

The Trade Desk

Business Development GM (Holdco)

New York, US•1 month ago

$134K – $245K

business developmentsalesagency

View job details→

TripleLift

Accountant

Detroit, United States; New York, US•1 month ago

$75K – $95K

accountingpayrollcompensation

View job details→

TripleLift

Associate Campaign Manager

Pune, India•1 month ago

ad opsprogrammaticcampaign management

View job details→