Engineering2 months agoOn-site

Samba TV

Senior Software Engineer - Data Integration

seniorsoftware engineerdata engineeringdistributed systemsPythonSQLJavaScriptSparkDatabricksAWSGCPKafkaFlinkSpark StreamingApache AirflowdbtPrefectSnowflakeETLELTAPI designdata pipelinesdata privacyGDPRCCPAmachine learningdata integrationplatform engineeringmentorship

Key details

Salary

Not specified

Employment type

Full-time

Seniority

Senior

Years experience

5-10

Location

Amsterdam, Netherlands

Full job description

Senior Software Engineer role on the Data Integration team in Amsterdam. Responsibilities include designing, building, and operating production-grade data systems for ingestion, transformation, and distribution of large-scale datasets. Requires 8+ years experience in software engineering focused on data engineering, backend systems, or distributed data infrastructure. Must be proficient in Python, SQL, and comfortable with JavaScript. Experience with distributed processing frameworks (Spark, Databricks), cloud platforms (AWS, GCP), streaming/event-driven frameworks (Kafka, Flink), workflow orchestration tools (Apache Airflow, dbt, Prefect), and API-first service design is required. Knowledge of data privacy regulations (GDPR, CCPA) and experience building compliant systems is necessary. Role involves technical leadership, mentoring, cross-team collaboration, operational ownership including on-call duties, and driving improvements in CI/CD and deployment processes. Preferred experience includes data warehousing/lakehouse technologies (Snowflake) and exposure to ad tech or digital media concepts.

What you'll do

Design, build, and maintain reliable data pipelines for ingestion, transformation, and distribution of large-scale datasets, processing high volumes efficiently in production
Develop ETL/ELT workflows using distributed computing frameworks on cloud infrastructure
Design and build API-first services exposing ingestion, processing, and distribution capabilities to internal teams and external consumers
Implement data quality validation, monitoring, and observability for owned components
Contribute to platform-grade, reusable components enabling downstream teams and self-service consumption
Take end-to-end ownership of key components within the data integration platform, driving reliability, scalability, and evolution
Build new partner and destination integrations end-to-end, including throughput tuning and operational handoff
Design and implement privacy-compliant data handling practices applying GDPR, CCPA, and company data governance policies
Engage cross-functional stakeholders to ensure systems support all downstream use cases
Drive technical design for components, produce design documents, contribute to architecture discussions, and align the team
Conduct rigorous code reviews and uphold high standards for code quality, testability, and maintainability
Mentor engineers through feedback, pairing, and design review
Collaborate across adjacent teams, advocate for shared standards
Own reliability of components end-to-end, monitor health, respond to incidents, and follow through on improvements
Participate in on-call rotations and improve operational practices
Drive improvements to CI/CD pipelines, deployment processes, and testing coverage

Requirements

8+ years of professional software engineering experience, with a strong focus on data engineering, backend systems, or distributed data infrastructure
Proficient in Python and SQL; comfortable with JavaScript in full-stack or API contexts
Strong hands-on experience with distributed processing frameworks (e.g., Spark, Databricks, or equivalent) working with large-scale datasets in production
Practical experience across AWS, GCP, and Databricks, and their core data services
Strong platform-thinking and API-first service design experience, building components that are reusable and consumable by downstream teams
Hands-on experience with streaming and event-driven data processing frameworks (e.g., Kafka, Flink, Spark Streaming, or equivalent)
Experience building or operating multi-tenant platforms with strong isolation and security boundaries
Hands-on experience with workflow orchestration tools (Apache Airflow, dbt, Prefect, or equivalent)
Experience incorporating AI and machine learning capabilities into production data workflows
Solid understanding of data privacy regulations (GDPR, CCPA) and practical experience building compliant systems
Clear communicator and cross-functional collaborator, able to articulate technical decisions and engage constructively in design reviews
Active mentor, invests in others, gives direct feedback, and cares about raising the bar for the team
Preferred: Strong familiarity with data warehousing and lakehouse technologies, with a preference for Snowflake
Preferred: Exposure to ad tech, audience activation, data licensing, or digital media, familiarity with device graphs, audience segmentation, identity resolution, or measurement

Tech stack

PythonSQLJavaScriptSparkDatabricksAWSGCPKafkaFlinkSpark StreamingApache AirflowdbtPrefectSnowflake

Apply now

Ready to take the next step in your career? Click the button below to continue to the application process.

Continue to application Browse more jobs

Company

Samba TV

Television remains a vibrant cultural influence and an essential source of entertainment and information worldwide. Tremendous growth in content choices, and viewing platforms that allow us to watch anything, anytime, on any screen, has actually made it harder for viewers to discover and keep up with all the great programming available. It’s also more competitive for content providers to keep your attention, and for marketers to make strong, measurable connections with their target consumers. Technology that improves the viewing experience, enables content discovery, and addresses audience fragmentation across screens will strengthen television’s business model and relevance to consumers. Data is at the center of any solution to make TV better. Samba TV's technology is built into Smart TVs and easily maps to smart phones and tablets. By recognizing what's on screen, Samba TV learns what viewers like and using machine learning algorithms, enables discovery of shows and actors in a whole new way. Likewise, our data and measurement products are transforming the way stakeholders across the media landscape are thinking about their business. Given the dramatic growth in streaming services, connected devices, time-shifting, and multi-screen viewership, our data products solve real problems and create a meaningful competitive advantage for our clients.

Industry

Technology, Information and Internet

Company size

201-500

Website

http://samba.tv

Posted

2 months ago

Category: Engineering

Similar jobs

More roles worth a look

Related opportunities based on specialty and working model so candidates can keep momentum.

TripleLift

Data Scientist

New York, US•2 months ago

$90K – $120K

data sciencemachine learningpython

View job details→

TripleLift

Director of Sales - US, West

Los Angeles, United States•2 months ago

$290K – $350K

sales leadershipprogrammaticCTV

View job details→

TripleLift

Director, Product Management

New York, US•2 months ago

$200K – $250K

product managementCTVprogrammatic

View job details→