AdTechTalent
Engineering26 days agoRemote

Sovrn

Principal Data Engineer

pythonpysparkkafkadatabricksawssnowflakellmragvector dbadtechdata engineeringprogrammaticaiagentic engineeringdata platformstreamingbig datadata governancementorship

Key details

Salary

Not specified

Employment type

Full-time

Seniority

Senior

Years experience

10+

Location

Boulder, Colorado, United States; New York City, New York, United States; Remote, United States

Full job description

Sovrn is hiring a Principal Software Engineer (Data) with expertise in adtech data infrastructure and AI-native data engineering. This senior role involves architectural ownership and technical leadership focused on Sovrn’s Data Collective. The candidate will lead design and evolution of high-throughput, real-time streaming and batch data platform systems, drive AI engineering practices including LLM integration and agentic workflows, and mentor engineering teams. Required skills include 10+ years software engineering experience, 5+ years in adtech data infrastructure, deep knowledge of programmatic ecosystems, real-time streaming, big data platforms (AWS, Snowflake, Databricks), data governance, and AI/LLM production experience. The role is based in Boulder, Colorado or New York City, with hybrid remote options in select states. Compensation ranges from $200,000 to $240,000 annually plus bonus, equity, and benefits including medical, dental, vision, 401(k), paid leave, and more.

What you'll do

  • Own the design and evolution of data platform systems that operate at exchange scale; high throughput, real-time streaming, and always-on batch pipelines
  • Lead architectural decisions across data infrastructure: pipeline design, data modeling, lakehouse architecture, and data services layers
  • Specify data platform components and configurations required for pipeline implementation; define pipeline observability to understand and improve performance at massive scale
  • Research, implement, and evolve methods to process and democratize data across the organization
  • Drive technical standards, design reviews, and engineering best practices across a senior team
  • Partner with product, data science, and platform teams to ship end-to-end
  • Establish and champion AI engineering practices across the team, from prompt engineering and RAG patterns to agentic workflow design, LLM evaluation, and progressive implementation of agentic design patterns
  • Identify high-leverage opportunities to apply AI in the data stack: intelligent pipeline optimization, anomaly detection, automated data quality, forecasting, and LLM-powered data services
  • Lead the evolution of existing LLM and agentic tooling from passive use to intentional, well-architected integration within the data platform
  • Set standards for evaluating, trusting, and operating AI-powered systems in production, including observability, fallback behavior, and model governance
  • Help the broader engineering team build fluency and confidence with AI tooling
  • Provide domain expertise across the organization to enable business growth through data services and data models
  • Provide counsel to all consumers and stakeholders of data to enable efficient and impactful use of data assets
  • Mentor and level up engineers through code review, design collaboration, and hands-on guidance; foster a culture of innovation and continuous learning
  • Operate with high autonomy across ambiguous, high-impact problems

Requirements

  • 10+ years of software engineering experience, with a strong data engineering and backend track record
  • 5+ years working specifically in adtech data infrastructure, SSP, DSP, exchange, or ad server environments
  • Deep fluency in the programmatic ecosystem: OpenRTB, bid request/response flows, auction mechanics, supply path optimization, or similar
  • Excellent understanding of real-time streaming and batch pipelines, big data, and data lakes; hands-on experience in distributed data processing in the AWS ecosystem
  • Strong understanding of second-layer big data platforms such as Snowflake and Databricks, applicable use cases, best practices, implementation, and support considerations
  • Strong experience in structured, unstructured, and semi-structured data techniques; metadata management, data lineage, and data governance
  • Experience with data security and compliance (PII, CCPA, GDPR, etc.)
  • Demonstrated experience leading AI or agentic engineering efforts in production environments; not just experimentation, but shipped, operated, and iterated on
  • Hands-on experience with LLM integration patterns: RAG, vector DBs, tool use, multi-step agentic workflows, prompt engineering, and evaluation frameworks
  • Ability to clearly document and communicate architectural concepts at multiple levels; ability to lead problem definition, solution designs, and implementation work plans
  • Understanding of DevOps and SRE practices
  • Comfort driving technical decisions in ambiguous, fast-moving environments
  • A point of view on where AI is and isn’t the right tool, and the credibility to make that case

Tech stack

PythonPySparkKafkaDatabricksAWSSnowflakeLLMRAGvector DBs

Benefits

Competitive salariesStock optionsMedical, dental, and vision coverageShort and long-term disabilityLife insurance11 paid holidaysFlexible vacationCommuter benefits401(k) plan and matchPaid parental leave programBonus and equity

Apply now

This MVP uses a placeholder application flow. In production, this section can connect to an external apply URL or a native application form.

Similar jobs

More roles worth a look

Related opportunities based on specialty and working model so candidates can keep momentum.