DevOps Engineer
Quest Global delivers world-class end-to-end engineering solutions by leveraging our deep industry knowledge and digital expertise. By bringing together technologies and industries, alongside the contributions of diverse individuals and their areas of expertise, we are able to solve problems better, faster. This multi-dimensional approach enables us to solve the most critical and large-scale challenges across the aerospace & defense, automotive, energy, hi-tech, healthcare, medical devices, rail and semiconductor industries.
We are looking for humble geniuses, who believe that engineering has the potential to make the impossible possible; innovators, who are not only inspired by technology and innovation, but also perpetually driven to design, develop, and test as a trusted partner for Fortune 500 customers. As a team of remarkably diverse engineers, we recognize that what we are really engineering is a brighter future for us all. If you want to contribute to meaningful work and be part of an organization that truly believes when you win, we all win, and when you fail, we all learn, then we're eager to hear from you. The achievers and courageous challenge-crushers we seek, have the following characteristics and skills.
DevOps Engineer will be responsible to build, manage, and automate our Workforce Experience Platform infrastructure. You will be working with engineering teams and focusing on AWS infrastructure and automation. You will also contribute to track and enhance system resilience and overall reliability of the system.
Technical Capabilities:
- Automating Tasks: Designing, maintenance and management of tools for automation of different operational processes. Design and Write code to automate repetitive tasks, such as provisioning new servers or managing configurations.
- Troubleshooting Outages: When incidents occur, dive into troubleshooting, identifying root causes, and resolving issues promptly.
- On-Call Responsibilities: Participate in on-call rotations, ensuring 24/7 availability and rapid response to incidents.
- Monitoring and Observability: They set up monitoring systems, track key metrics, and respond proactively to anomalies.
- Capacity Planning: Analyze system capacity, predict resource needs, and optimize infrastructure.
- Deployment and Release Management: Deployment, automation, management, configuration and maintenance of AWS cloud-based production system.
Process Capabilities:
- Change Management: Oversee how code is deployed, configured, and monitored.
- Availability and Latency: Focus on maintaining high availability and low latency for services.
- Emergency Response: Incident management, ensuring timely resolution and minimal impact.
- Capacity Management: Assess system capacity, scaling resources as needed.
- Documentation: Document processes, best practices, and incident resolutions.
- Collaboration: They work closely with development teams, fostering collaboration and shared responsibility.
What You Will Bring:
- Experience as Devops for large cloud native applications
- Effective communication and collaboration with cross-functional teams.
- Manage your own time and work well both independently and as part of a team
- Good understanding of Agile processes
- Experience in code development in at least one high-level programming language.
- Familiarity with Operating Systems and networking.
- Extensive experience on AWS platform - EC2, EKS, S3, RDS, IAM, CloudFront, CloudWatch, SNS/SQS, Kubernetes, ElastiCache, Lambda, AWS IOT, Kinesis. Experience with multi-tier architectures: load balancers, caching, web servers, application servers, databases, and networking.
- Knowledge of monitoring and observability tools: Grafana and Splunk.
- Comfortable with infrastructure-as-code (e.g., Terraform, Ansible).
- Strong analytical skills to troubleshoot complex issues.
- CI/CD, pods management, Sonar Q, Git, etc
- Experience on enterprise level device manageability and SaaS solutions.
- Some experience on Gen-AI and other AI related capabilities to enhance monitoring tools.
Pay Range: $110,000-$130,000/Annum
Compensation decisions are made based on factors including experience, skills, education, and other job-related factors, in accordance with our internal pay structure. We also offer a comprehensive benefits package, including health insurance, paid time off, and retirement plan.
Work Requirements:
- This role is considered an on-site position located in Houston, TX.
- You must be able to commute to and from the location with your own transportation arrangements to meet the required working hours.