AdTechTalent
Engineering7 days agoOn-site

Samba TV

Senior Software Engineer - Data Integration

seniorsoftware engineerdata engineeringdistributed systemsPythonSQLJavaScriptSparkDatabricksAWSGCPKafkaFlinkSpark StreamingApache AirflowdbtPrefectSnowflakeETLELTAPI designdata pipelinesdata privacyGDPRCCPAmachine learningdata integrationplatform engineeringmentorship

Key details

Salary

Not specified

Employment type

Full-time

Seniority

Senior

Years experience

5-10

Location

Amsterdam, Netherlands

Full job description

Senior Software Engineer role on the Data Integration team in Amsterdam. Responsibilities include designing, building, and operating production-grade data systems for ingestion, transformation, and distribution of large-scale datasets. Requires 8+ years experience in software engineering focused on data engineering, backend systems, or distributed data infrastructure. Must be proficient in Python, SQL, and comfortable with JavaScript. Experience with distributed processing frameworks (Spark, Databricks), cloud platforms (AWS, GCP), streaming/event-driven frameworks (Kafka, Flink), workflow orchestration tools (Apache Airflow, dbt, Prefect), and API-first service design is required. Knowledge of data privacy regulations (GDPR, CCPA) and experience building compliant systems is necessary. Role involves technical leadership, mentoring, cross-team collaboration, operational ownership including on-call duties, and driving improvements in CI/CD and deployment processes. Preferred experience includes data warehousing/lakehouse technologies (Snowflake) and exposure to ad tech or digital media concepts.

What you'll do

  • Design, build, and maintain reliable data pipelines for ingestion, transformation, and distribution of large-scale datasets, processing high volumes efficiently in production
  • Develop ETL/ELT workflows using distributed computing frameworks on cloud infrastructure
  • Design and build API-first services exposing ingestion, processing, and distribution capabilities to internal teams and external consumers
  • Implement data quality validation, monitoring, and observability for owned components
  • Contribute to platform-grade, reusable components enabling downstream teams and self-service consumption
  • Take end-to-end ownership of key components within the data integration platform, driving reliability, scalability, and evolution
  • Build new partner and destination integrations end-to-end, including throughput tuning and operational handoff
  • Design and implement privacy-compliant data handling practices applying GDPR, CCPA, and company data governance policies
  • Engage cross-functional stakeholders to ensure systems support all downstream use cases
  • Drive technical design for components, produce design documents, contribute to architecture discussions, and align the team
  • Conduct rigorous code reviews and uphold high standards for code quality, testability, and maintainability
  • Mentor engineers through feedback, pairing, and design review
  • Collaborate across adjacent teams, advocate for shared standards
  • Own reliability of components end-to-end, monitor health, respond to incidents, and follow through on improvements
  • Participate in on-call rotations and improve operational practices
  • Drive improvements to CI/CD pipelines, deployment processes, and testing coverage

Requirements

  • 8+ years of professional software engineering experience, with a strong focus on data engineering, backend systems, or distributed data infrastructure
  • Proficient in Python and SQL; comfortable with JavaScript in full-stack or API contexts
  • Strong hands-on experience with distributed processing frameworks (e.g., Spark, Databricks, or equivalent) working with large-scale datasets in production
  • Practical experience across AWS, GCP, and Databricks, and their core data services
  • Strong platform-thinking and API-first service design experience, building components that are reusable and consumable by downstream teams
  • Hands-on experience with streaming and event-driven data processing frameworks (e.g., Kafka, Flink, Spark Streaming, or equivalent)
  • Experience building or operating multi-tenant platforms with strong isolation and security boundaries
  • Hands-on experience with workflow orchestration tools (Apache Airflow, dbt, Prefect, or equivalent)
  • Experience incorporating AI and machine learning capabilities into production data workflows
  • Solid understanding of data privacy regulations (GDPR, CCPA) and practical experience building compliant systems
  • Clear communicator and cross-functional collaborator, able to articulate technical decisions and engage constructively in design reviews
  • Active mentor, invests in others, gives direct feedback, and cares about raising the bar for the team
  • Preferred: Strong familiarity with data warehousing and lakehouse technologies, with a preference for Snowflake
  • Preferred: Exposure to ad tech, audience activation, data licensing, or digital media, familiarity with device graphs, audience segmentation, identity resolution, or measurement

Tech stack

PythonSQLJavaScriptSparkDatabricksAWSGCPKafkaFlinkSpark StreamingApache AirflowdbtPrefectSnowflake

Apply now

This MVP uses a placeholder application flow. In production, this section can connect to an external apply URL or a native application form.

Similar jobs

More roles worth a look

Related opportunities based on specialty and working model so candidates can keep momentum.