View All Jobs 141227

Site Reliability Engineer

Implement comprehensive observability and alerting systems for critical cloud-native infrastructure
Noida, Uttar Pradesh, India
Senior
21 hours agoBe an early applicant
NTT DATA

NTT DATA

Provides global IT services and consulting, specializing in digital transformation, application development, infrastructure, and business process outsourcing.

Site Reliability Engineer

NTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an inclusive, adaptable, and forward-thinking organization, apply now.

We are currently seeking a Site Reliability Engineer to join our team in Noida, Uttar Pradesh (IN-UP), India (IN).

Role Overview

We are seeking an experienced Site Reliability Engineer (SRE) with 5–8 years of expertise in ensuring the reliability, scalability, and performance of critical systems. The ideal candidate must have strong hands-on experience in observability, monitoring, alerting, Splunk, and telemetry, along with solid understanding of cloud-native infrastructure and automation.

Key Responsibilities

  • Implement and maintain observability across metrics, logs, traces, and events.
  • Build and optimize monitoring dashboards and service health indicators using Splunk or similar tools.
  • Configure, fine-tune, and maintain proactive alerts with high signal-to-noise ratio.
  • Lead incident response, conduct root cause analysis (RCA), and drive long-term corrective measures.
  • Define, measure, and enhance SLIs, SLOs, reliability KPIs, and error budgets.
  • Improve system performance, scalability, and availability across environments.
  • Automate monitoring, alerting, and operational workflows to reduce manual toil.
  • Standardize and maintain telemetry instrumentation across services.
  • Own and optimize logging pipelines, ingestion, parsing, indexing, and retention.
  • Collaborate with engineering teams to integrate reliability best practices into application development.
  • Participate in on-call rotations and ensure timely incident resolution.
  • Partner with cloud/platform teams to enhance deployment readiness and operational stability.

Required Skills & Experience

  • 5–8 years of experience in SRE, DevOps, or system reliability roles.
  • Strong hands-on experience with Splunk (queries, dashboards, alerts, ingestion).
  • Solid understanding of observability tools (Splunk, Prometheus, Grafana, Datadog, OpenTelemetry, etc.).
  • Strong knowledge of Linux, networking fundamentals, and distributed systems.
  • Experience with cloud platforms (AWS / Azure / GCP) and container technologies (Docker, Kubernetes).
  • Proficiency in scripting (Python, Shell, or similar).
  • Experience with production on-call environments and incident management.
  • Familiarity with SLIs/SLOs, capacity planning, and reliability engineering concepts.
  • Experience with OpenTelemetry–based instrumentation.
  • Exposure to APM tools (Dynatrace, AppDynamics, New Relic).
  • Knowledge of IaC tools like Terraform or Ansible.
  • Understanding of microservices architecture and CI/CD pipelines.

NTT DATA is a $30 billion business and technology services leader, serving 75% of the Fortune Global 100. We are committed to accelerating client success and positively impacting society through responsible innovation. We are one of the world's leading AI and digital infrastructure providers, with unmatched capabilities in enterprise-scale AI, cloud, security, connectivity, data centers and application services. Our consulting and industry solutions help organizations and society move confidently and sustainably into the digital future. As a Global Top Employer, we have experts in more than 50 countries. We also offer clients access to a robust ecosystem of innovation centers as well as established and start-up partners. NTT DATA is a part of NTT Group, which invests over $3 billion each year in R&D.

+ Show Original Job Post
























Site Reliability Engineer
Noida, Uttar Pradesh, India
Engineering
About NTT DATA
Provides global IT services and consulting, specializing in digital transformation, application development, infrastructure, and business process outsourcing.