AdTechTalent
Engineering6 days agoHybrid

Samba TV

Junior Software Engineer - Data Integration

pythonsqlawsgcpairflowdbtsparkdatabrickskafkadata engineeringdata pipelinesetleltdistributed computinggdprccpasoftware engineeringbackenddata infrastructure

Key details

Salary

Not specified

Employment type

Full-time

Seniority

Entry

Years experience

0-2

Location

Amsterdam, Netherlands

Full job description

Junior Software Engineer role on the Data Integration team in Amsterdam. The team builds and operates data ingestion, processing, enrichment, and distribution platforms. Responsibilities include building and maintaining production-grade data systems, contributing to data pipelines, building ETL/ELT workflows on cloud infrastructure, implementing and testing APIs, writing automated tests, applying data governance (GDPR, CCPA), collaborating with the team, participating in code reviews, operational ownership including on-call rotations, monitoring, alerting, and improving CI/CD pipelines. Requirements: 2+ years software engineering experience with a relevant degree or equivalent, proficiency in Python and SQL, experience with AWS or GCP data services, data pipeline tools like Airflow or dbt, familiarity with data privacy regulations, strong communication and collaboration skills. Preferred experience with distributed processing frameworks (Spark, Databricks), streaming/event-driven systems (Kafka), and ad tech or digital media exposure. Benefits include health insurance, wellness offerings, life and disability insurance, retirement savings plan, paid holidays and PTO, and bonus incentives.

What you'll do

  • Contribute to data pipelines for ingestion, transformation, and distribution of datasets, implementing features that require analysis across a variety of factors
  • Build ETL/ELT workflows using distributed computing frameworks on cloud infrastructure, applying team standards and adapting approaches where needed
  • Implement and test API endpoints that expose ingestion, processing, and distribution capabilities, with a focus on correctness and clear contracts
  • Write automated tests and contribute to data quality validation and monitoring for pipeline components
  • Apply Samba's platform patterns to produce reliable, maintainable code that fits within the broader system
  • Implement partner and destination integrations against established patterns
  • Apply privacy and data governance requirements (GDPR, CCPA) to the code, following established team patterns and escalating questions appropriately
  • Collaborate closely with the team and reason about how work fits within the broader data platform
  • Participate in code reviews as author and reviewer
  • Write clear, well-tested, maintainable code that meets the team's quality standards
  • Communicate proactively about blockers, progress, and questions, and engage constructively in team discussions
  • Participate in on-call rotations and contribute to improving operational practices
  • Implement monitoring and alerting for owned components
  • Help maintain and improve CI/CD pipelines and testing coverage for team systems

Requirements

  • 2+ years of professional software engineering experience with a Bachelor's degree in Computer Science, Software Engineering, or a related technical field (or an advanced degree without prior experience, or equivalent)
  • Experience in data engineering, backend systems, or distributed data infrastructure
  • Proficiency in Python and SQL; ability to write clean, tested, production-ready code
  • Hands-on experience with AWS or GCP and their core data services
  • Experience with data pipeline tools or workflow orchestration (e.g., Airflow, dbt, or equivalent) in a production environment
  • Familiarity with data privacy regulations (GDPR, CCPA)
  • Ability to communicate clearly, ask good questions, and work collaboratively within a team
  • Preferred: Exposure to distributed processing frameworks (e.g., Spark, Databricks, or equivalent)
  • Preferred: Familiarity with streaming or event-driven data systems (e.g., Kafka or equivalent)
  • Preferred: Exposure to ad tech, audience activation, data licensing, or digital media

Tech stack

PythonSQLAWSGCPAirflowdbtSparkDatabricksKafka

Benefits

Health insuranceWellness offeringsLife and disability insuranceRetirement savings planPaid holidaysPaid time off (PTO)Bonuses, short-term incentives, and long-term incentives

Apply now

This MVP uses a placeholder application flow. In production, this section can connect to an external apply URL or a native application form.

Similar jobs

More roles worth a look

Related opportunities based on specialty and working model so candidates can keep momentum.