NTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an inclusive, adaptable, and forward-thinking organization, apply now.
We are currently seeking an OpenShift Engineer to join our team in Bangalore, Karnātaka (IN-KA), India (IN).
Educational Qualification: Must be a graduate (B. Tech/B.E./MCA or equivalent).
Experience: 8 to 10 years of experience in Systems Engineering, with a significant focus on Linux Internals and Enterprise OpenShift (OCP 4.x) administration.
Advanced Linux Engineering & Internals:
• Expert Linux Administration: Deep expertise in RHEL/CoreOS internals, including kernel tuning, memory management (OOM killer optimization), and systemd.
• Performance Engineering: Advanced skills in system-level troubleshooting (strace, tcpdump, sar) and capacity planning for large-scale enterprise workloads.
• Hardening & Compliance: Architecting security baselines using CIS/STIG benchmarks and managing vulnerability remediation via Red Hat Satellite or Ansible.
OpenShift & Cloud Architecture:
• Cluster Design: Designing high-availability, multi-zone OpenShift clusters (IPI/UPI) and managing Control Plane health (ETCD tuning, API server optimization).
• Managed Cloud Services: Deep hands-on experience with ROSA (AWS) and ARO (Azure), including PrivateLink, STS integration, and IAM federation.
• Infrastructure Operators: Expert knowledge of the Machine Config Operator (MCO), Operator Lifecycle Manager (OLM), and custom resource definitions (CRDs).
Automation, GitOps & IaC:
• Automation Frameworks: Advanced proficiency in Python and Bash for building custom infrastructure tooling and automation.
• GitOps Mastery: Implementation of ArgoCD for "Cluster as Code" to maintain consistency across dev, test, and production environments.
• IaC Leadership: Extensive experience using Terraform and Ansible to automate multi-cluster provisioning and Day-2 operations.
Monitoring, SRE & Observability:
• Observability Stack: Expert knowledge of Prometheus, Grafana, and Thanos for multi-cluster monitoring and performance visualization.
• Log Analytics: Implementing and scaling the EFK/Loki stack for centralized enterprise logging.
• SRE Mindset: Defining SLIs/SLOs, managing error budgets, and implementing proactive self-healing for platform reliability.
Leadership & Incident Management:
• Strategic Troubleshooting: Act as a Tier-3 escalation point for major incidents (P1/P2); lead deep-dive Root Cause Analysis (RCA).
• Mentorship: Technical leadership and mentoring of L1/L2 engineering teams to elevate overall infrastructure standards.
• Stakeholder Management: Collaborating with DevOps and Application teams to align platform architecture with business goals.
Certifications (Highly Desirable): RHCA (Red Hat Certified Architect) or RHCE (Certified Engineer). EX380 (Red Hat Certified Specialist in OpenShift Automation and Management). CKA (Certified Kubernetes Administrator) or CKS (Certified Kubernetes Security Specialist).
Years of Experience: 8 to 10 Years. Work Timings: Willing to work in rotational shifts / on-call support in a 24x7 environment.
About NTT DATA:
NTT DATA is a $30 billion business and technology services leader, serving 75% of the Fortune Global 100. We are committed to accelerating client success and positively impacting society through responsible innovation. We are one of the world's leading AI and digital infrastructure providers, with unmatched capabilities in enterprise-scale AI, cloud, security, connectivity, data centers and application services. Our consulting and industry solutions help organizations and society move confidently and sustainably into the digital future. As a Global Top Employer, we have experts in more than 50 countries. We also offer clients access to a robust ecosystem of innovation centers as well as established and start-up partners. NTT DATA is a part of NTT Group, which invests over $3 billion each year in R&D.