View All Jobs 131498

Site Reliability Engineer - Remote Eligible

Design proactive observability systems to prevent platform outages and improve reliability
Remote
Senior
yesterday
Virtasant

Virtasant

A global cloud services provider offering solutions for cloud migration, management, and optimization with a focus on cost efficiency.

Site Reliability Engineer

We're looking for a Site Reliability Engineer to join a high-impact cloud infrastructure team at one of Virtasant's key technology partners. You'll play a critical role in improving system observability, ensuring platform reliability, and embedding proactive engineering practices across a globally distributed environment.

This is a hands-on technical role ideal for someone who brings a developer's mindset to SRE work. You'll be expected to anticipate problems before they happen, automate smart solutions, and elevate how reliability is measured, built, and maintained.

What You'll Do

  • Drive the creation and evolution of observability systems — including dashboards, logging, alerting, and instrumentation.
  • Identify trends, anomalies, and early warning signs through data analysis.
  • Work with engineers to drive the adoption of observability best practices across squads.
  • Surface, propose, and implement proactive reliability improvements across AWS environments.
  • Contribute to build, test, and deploy workflows (CI/CD), with a strong emphasis on automation.
  • Collaborate across teams using agile ceremonies, async-first workflows, and direct feedback loops.

What We're Looking For

Must-Have Experience

  • Deep knowledge of observability tooling, preferably with Datadog
  • Hands-on SRE experience within AWS, including Lambda, containers, and IAM
  • Strong programming skills in Python and Ruby
  • Experience with Terraform and infrastructure as code (IaC) practices
  • Familiarity with incident management, on-call rotations, and SLAs
  • Ability to identify patterns and risks from telemetry and act on them proactively

Nice-to-Haves

  • Previous experience as a software developer or DevOps engineer
  • Knowledge of reliability strategies for containerized workloads
  • Comfortable contributing to CI pipelines and deployment strategies
  • Experience working in environments with limited QA/BA handoffs

Tools & Environment

  • Languages: Python, Ruby
  • Cloud: AWS (Lambda, ECS, IAM)
  • IaC: Terraform
  • Observability: Datadog
  • Workflow: Agile (Scrum), Jira, Git, CI/CD pipelines

Why This Role is Exciting

  • Observability-first culture – You won't just respond to alerts; you'll design the systems that prevent them.
  • Hands-on impact – You'll drive real improvements that increase uptime, performance, and engineering confidence.
  • Autonomy & ownership – Work independently while contributing to high-performing global teams.
  • Real scale challenges – Help support large-scale, distributed systems with meaningful end-user impact.

Why Virtasant

Virtasant is a global cloud consulting and technology company operating across 130+ countries. We deliver transformative solutions across cloud cost optimization, software engineering, technology operations, and AI. Our projects are meaningful, our teams are globally distributed, and our culture is built on autonomy, trust, and technical excellence.

+ Show Original Job Post
























Site Reliability Engineer - Remote Eligible
Remote
Engineering
About Virtasant
A global cloud services provider offering solutions for cloud migration, management, and optimization with a focus on cost efficiency.