View All Jobs 139661

Site Reliability Engineer

Maintain and improve Azure cloud system reliability through proactive monitoring and troubleshooting.
Guadalajara, Jalisco, Mexico
Mid-Level
3 weeks ago
NTT DATA

NTT DATA

Provides global IT services and consulting, specializing in digital transformation, application development, infrastructure, and business process outsourcing.

Site Reliability Engineer

We are currently seeking a Site Reliability Engineer to join our team in GDL, Jalisco (MX-JAL), Mexico (MX). Perform L1.5 activities such as monitoring, deployment, rollback. Monitor the efficiency of the Azure cloud systems to prevent outages and initiate an Incident Management bridge in case of an outage. Troubleshoot Azure resources, escalate to Level 3 (Software Development Team). Understand the Microsoft Azure Cloud - ideally Azure Fundamentals certified OR Computer Science/Information Systems Management degree. Familiar with PaaS and IaaS - VMs, Storage, EventHub, Service Fabric Cluster (SFC), Azure Kubernetes Service (AKS), CosmosDB, SQL Server, IoT Hub, Databricks, KeyVault, Datalake. Understand the concept of Internet of Things (IoT) - telemetry, ingestion, processing, data storage, reporting. Understand the concept tools - Octopus, Bamboo, Terraform, Azure DevOps, Jenkins, Github, Ansible. Understand the concept of container orchestration platforms (e.g. Kubernetes). Understand the concept of scripts: Powershell, Python. Understand the difference between NoSQL and SQL databases, and how to maintain them. Understand monitoring and logging systems (LogAnalytics, Splunk, ELK, Prometheus, Nagios, Zabbix, etc.). Independent thinker - why does it break, what can I proactively do to fix it.

+ Show Original Job Post
























Site Reliability Engineer
Guadalajara, Jalisco, Mexico
Engineering
About NTT DATA
Provides global IT services and consulting, specializing in digital transformation, application development, infrastructure, and business process outsourcing.