Principal Consultant Infrastructure & DevOps
We're a new generation software company based in Hyderabad, helping to scale digital businesses to disrupt global utility retail markets. We provide technology development, customer experience and process optimisation services to support our award-winning utility retailers in New Zealand and Australia.
It's an exciting time where traditional utilities need to innovate. Consumers expect companies to do good for their employees, customers, local communities, and for the future of the planet (all while offering seamless user experience that's great value). Our strategy recognises that the exceptional technology we create makes us one of the best consumer facing businesses in our industry.
Purpose of the Job
The Infrastructure & DevOps Lead will be responsible for building, implementing, and managing scalable infrastructure solutions across hybrid and cloud environments. With a strong focus on automation, CI/CD, and container orchestration, the role ensures high availability, security, and performance of applications. The candidate will also lead a team, manage cross-functional collaboration, and drive innovation by leveraging AI-driven approaches for observability, incident response, and automation.
Key Responsibilities
- Infrastructure & Cloud Management
- Configure, deploy, and manage infrastructure on AWS (EC2, S3, ECS, EKS, RDS, IAM, AWS Organizations, SCPs).
- Ensure secure, compliant, and cost-optimized cloud operations.
- Manage and optimize Kubernetes clusters and containerized workloads.
- Automation & DevOps
- Develop and maintain Infrastructure as Code using Terraform.
- Build, manage, and optimize CI/CD pipelines for faster and reliable deployments.
- Implement monitoring, observability, and AI-driven insights for proactive issue detection.
- Operations & Incident Management
- Oversee incident, problem, and change management processes.
- Lead root cause analysis, performance tuning, and incident post-mortems.
- Handle rosters and ensure 24x7 operational coverage with the team.
- Utilize APM tools (Experience with Grafana, Prometheus etc. AppDynamics, New Relic a plus) for monitoring and troubleshooting.
- Leadership & Collaboration
- Manage and mentor a team of DevOps engineers, fostering a culture of learning and ownership.
- Collaborate with application, QA, product, and security teams to ensure smooth delivery.
- Partner with leadership to define strategy and adopt AI-enabled automation for infra and DevOps.
- Innovation & AI Integration
- Explore and integrate AI/ML solutions for predictive scaling, automated incident detection, and intelligent remediation.
- Drive adoption of AI copilots/assistants for developers and operations.
Desired Skills & Experience
- Experience: 15+ years overall, with 5+ years in Infrastructure & DevOps.
- Cloud Expertise: Deep hands-on experience with AWS (EC2, S3, ECS, EKS, RDS, IAM, Organizations, SCPs).
- Automation & IaC: Strong in Terraform, Ansible, Helm, Kubernetes.
- CI/CD: Proficiency in building pipelines using Jenkins, GitLab CI, GitHub Actions, or Azure DevOps.
- Observability & APM: Knowledge of (Grafana, Prometheus, Open Telemetry or similar).
- Incident & Operations: Proven track record in incident management, on-call roster handling, and root cause analysis.
- Leadership: Demonstrated people management, mentoring, and team-building experience.
- Cross-team Communication: Strong ability to collaborate with developers, testers, product owners, and leadership teams.
- AI Awareness: Familiarity with AI-driven DevOps trends such as AIOps, self-healing systems, predictive monitoring, and workflow automation.
- Soft Skills: Excellent communication, problem-solving mindset, and ability to influence stakeholders.
Nice-to-Have Skills:
- Certifications: AWS Certified DevOps Engineer, Kubernetes CKAD/CKA, Terraform Associate.
Want to help us make it better? Apply and we'll be in touch.