Full job description
PubMatic is hiring a Release Operations Engineer responsible for planning and executing software releases across environments. The role involves managing application deployments, configurations, monitoring post-release performance, troubleshooting incidents, and maintaining deployment documentation. Candidates should have 1-5 years experience in Release Operations, CICD, configuration/change management, Linux/Unix administration, scripting (Shell, Python, Ansible), Docker, Kubernetes, configuration management tools (Ansible, Chef, Puppet, Terraform), monitoring tools (Nagios, ELK, PagerDuty), networking, disaster recovery, and cloud infrastructure (AWS, GCP). The role requires strong communication, problem-solving skills, and the ability to work independently and in multicultural teams. Responsibilities include server setup, security patching, and 24/5 rotational shifts with on-call support. A bachelor's degree in engineering (CS/IT) is required. The position offers a hybrid work schedule (3 days in office, 2 days remote) and benefits including parental leave, healthcare insurance, broadband reimbursement, and office amenities.
What you'll do
- Application deployment management, improve deployment processes, ensure consistency and reliability of deployments across development/production environments
- Managing application configurations and deployment servers inventory
- Monitor health and performance of applications post-release, respond to and troubleshoot incidents related to releases/operations
- Act as liaison between Release Manager, Operations, and Engineering stakeholders, communicate release status and issues, facilitate post-mortem reviews and retrospectives
- Create and maintain deployment procedures and related documentation
- Setting up Linux servers, application installation, server rebuilds, cluster management
- Ensure platform support for cross-functional delivery of large-scale, complex projects involving multiple teams
- Ensure security measures, address vulnerabilities by upgrades/patches
- 24/5 rotational weekday shifts including night, weekend on-call support
Requirements
- 1 - 5 years of experience in Release Operations, managing CICD & Application deployments, Configuration/Change Management, secrets management, and Application support
- Good experience with Linux/Unix administration, Shell script / Python / Ansible Automation
- Experience with Docker and related tools (Kubernetes, GIT, Grafana, Prometheus, etc.)
- Experience working in Config management tools (Ansible, Chef, Puppet, Terraform, etc.)
- Experience with monitoring and alerting (Nagios, ELK, PagerDuty, etc.)
- Experience with basic networking and load balancing
- Experience designing, implementing, and testing Disaster Recovery plans during deployment failures
- Proven ability to work and communicate with a team of highly technical software/systems engineers
- Ability to work independently with minimal direction
- Attention to detail and problem-solving skills
- Experience working in both premise and cloud infrastructure (AWS, GCP)
- Experience with distributed systems and related technologies is a plus
- Strong analytical skills
- Excellent communication skills
- Team player with ability to work with different multicultural teams
- Proactive attitude on identifying problems and providing solutions
- Creative and innovative thinking
- Apply AI driven insights for release validations, risk detection, and post release analysis
- Use GenAI /LLM tools to automate repetitive operations tasks, release workflows and documentation
- Bachelor’s degree in engineering (CS / IT) or equivalent
Tech stack
Linux/UnixShell scriptingPythonAnsibleDockerKubernetesGITGrafanaPrometheusChefPuppetTerraformNagiosELKPagerDutyAWSGCP
Benefits
Paternity/maternity leaveHealthcare insuranceBroadband reimbursementKitchen with healthy snacks and drinksCatered lunches