AdTechTalent
Other2 months agoHybrid

Epsilon

Information Technology Analyst

GrafanaELKElastic StackLinuxWindows ServerKubernetesIT operationsincident managementmonitoringServiceNowITILNOCOCC

Key details

Salary

Not specified

Employment type

Full-time

Seniority

Mid-level

Years experience

3-5

Location

Bengaluru, India

Full job description

The role involves monitoring enterprise IT infrastructure dashboards using Grafana and ELK, validating alerts, reducing noise, and classifying incidents by severity. Responsibilities include L1 operational monitoring and triage for Kubernetes clusters, Linux and Windows administration tasks such as patching and troubleshooting, incident management using ServiceNow, and acting as first responder for major incidents. The position requires 3-5 years of experience in IT operations or infrastructure support, hands-on skills in Linux/Unix and Windows Server administration, familiarity with monitoring tools like Grafana and ELK, and understanding of ITIL incident management processes. The role operates in a 24x7 shift-based environment and offers opportunities to develop skills in modern observability platforms and incident command.

What you'll do

  • Monitor enterprise dashboards using Grafana & ELK for logs, metrics, and alerts
  • Validate alerts, reduce noise, and classify incidents by severity (P1–P3)
  • Provide L1 operational monitoring and triage for Kubernetes clusters
  • Perform Linux and Windows administration tasks, including patching, health checks, and troubleshooting
  • Open, update, and manage incidents in ServiceNow with accurate diagnostics
  • Act as first responder for major incidents and support incident bridges and escalations
  • Maintain shift logs, handover notes, SOPs, and operational documentation

Requirements

  • Bachelor’s degree in engineering, Computer Science, IT, or equivalent discipline
  • 3-5 years of experience in IT operations, NOC, OCC, or infrastructure support roles
  • Hands-on exposure to Linux / Unix administration
  • Hands-on exposure to Windows Server administration and patching
  • Experience with monitoring tools such as Grafana and ELK
  • Understanding of ITIL-aligned incident management processes
  • Willingness to work in 24x7 shift-based operations

Tech stack

GrafanaELKElastic StackOpsRampSolarWindsPagerDutyServiceNowLinuxWindows ServerKubernetes

Benefits

Exposure to modern observability platforms such as Grafana and ELKHands-on experience with major incident management and real-time incident commandOpportunities to build strong foundational skills in Linux, Windows, and Kubernetes operationsFast-paced environment that builds decision-making, communication, and operational rigourPeople centricity and supportive work environmentCollaboration and collective goal achievementOpportunities for growth through learning, development and career advancementInnovation through cutting-edge solutionsWork-life balance and flexibilityEqual opportunity employer promoting diversity and inclusion

Apply now

This MVP uses a placeholder application flow. In production, this section can connect to an external apply URL or a native application form.

Similar jobs

More roles worth a look

Related opportunities based on specialty and working model so candidates can keep momentum.