Full job description
The Principal Software Engineer - Ad Tech & Distributed Systems leads reliability, performance, and operational excellence of FreeWheel platforms. Responsibilities include owning production reliability, designing and operating monitoring and alerting systems, leading incident response and root cause analysis, supporting live events, driving automation, influencing architecture for production readiness, managing change and capacity planning, championing security practices, enforcing engineering operations standards, and participating in on-call rotations. Requires 10+ years software engineering experience, 5+ years with AWS, expertise in distributed systems, strong coding/debugging skills in Python, Go-Lang, or Scala, knowledge of Linux and cloud infrastructure, proficiency with infrastructure-as-code and CI/CD tools, and strong communication skills. Bachelor's degree or equivalent experience required. Salary range $152,828.79 - $229,243.19.
What you'll do
- Own production reliability, availability, latency, and performance of large-scale, mission-critical systems
- Design, implement, and operate monitoring, alerting, and observability solutions to ensure system health and rapid detection of issues
- Lead incident response, root cause analysis, and post-incident reviews to drive long-term reliability improvements
- Support and ensure stable operations during high-visibility, time-sensitive live events and releases
- Drive automation initiatives to reduce operational toil, improve efficiency, and increase system resilience
- Partner with software engineering teams to influence architecture and design decisions with production readiness in mind
- Lead and execute change management, capacity planning, and production readiness reviews
- Champion security, vulnerability management, and secure configuration practices across production environments
- Enforce and continuously improve Engineering Operations processes, standards, and best practices
- Participate in on-call rotations, including weekend coverage, and provide escalation support for complex production issues
Requirements
- 10+ years of professional experience in software development/engineering
- 5+ years experience with AWS
- Expert-level coding, debugging, and troubleshooting skills across complex, distributed production systems
- Proven ability to lead and mentor engineers in automation, reliability engineering, and production problem-solving
- Strong experience designing and operating server-side applications or services using Python, Go-Lang, or Scala
- Experience developing, operating, and troubleshooting distributed systems and backend services
- Familiarity with data processing platforms, data pipelines, and large-scale system architectures
- Deep knowledge of Linux systems, system internals, networking, and production infrastructure
- Extensive experience with AWS cloud architecture and services including VPC, subnets, NACLs, security groups, EC2, S3, IAM, Route 53, Lambda, and related services
- Proficiency with infrastructure-as-code and configuration management tools and practices
- Mastery of CI/CD and SDLC tools (Docker, Kubernetes, Jenkins, Git, Ansible, Chef, and Puppet)
- Strong understanding of database technologies, SQL, performance tuning, and operational data management
- Advanced analytical and data-driven problem-solving skills, including use of metrics to guide decisions
- Strong communication skills, attention to detail, adaptability, and ability to work effectively within a global, cross-functional team
- Bachelor's Degree or equivalent combination of coursework and experience
Tech stack
AWSPythonGo-LangScalaLinuxVPCsubnetsNACLssecurity groupsEC2S3IAMRoute 53Lambdainfrastructure-as-codeDockerKubernetesJenkinsGitAnsibleChefPuppetSQLC++
Benefits
Base pay within range $152,828.79 - $229,243.19 dependent on experienceBonus eligibility for non-sales positionsComprehensive benefits supporting physical, financial, and emotional well-beingCommission eligibility for sales positionsEqual opportunity employer