View All Jobs 170754

Site Reliability Engineer (P3) - Cloud Infrastructure - Remote Eligible

Automate deployment and scaling of cloud infrastructure using Infrastructure as Code practices
Remote
Senior
6 days ago
Weave

Weave

A software platform offering integrated business communication tools, including phone, text, and email, for small and medium-sized businesses.

Cloud Infrastructure Engineer

In this role, you will be responsible for building, maintaining, and improving the cloud infrastructure that powers Weave's services. You will work with a modern tech stack, including Google Cloud Platform (GCP), Go, Kubernetes, Terraform, Prometheus, Grafana, and Vault. As an engineer, you will be proficient in core tools and languages, capable of completing routine tasks independently, and will use established patterns to create high-quality, maintainable solutions. You will play a key role in ensuring the reliability, scalability, and performance of our platform.

This position will be remote

Reports to: Josh Keife

What You Will Own

  • Automate away as much of the day to day work as possible
  • Design and implement highly available and scalable systems
  • Ensure smooth day to day operations of Weave's infrastructure
  • Build and evolve tools and standards for automation, scaling, monitoring, and alerting
  • Collaborate with product teams to resolve production issues, improve monitoring and leverage cloud services
  • Participate in weekly on-call rotation

What You Will Need to Accomplish the Job

  • Proficiency with at least one cloud platform is required, with GCP services (e.g., GKE, Compute Engine, VPC) being a plus.
  • Solid understanding of containerization technologies such as Kubernetes and Docker
  • Experience with automation tools such as Puppet, Salt, Ansible, Terraform
  • Experience writing automation using Go, Python, etc.
  • Experience designing highly available and scalable systems
  • Proficient with version control systems (e.g., Git) and CI/CD concepts
  • Problem-solving and the ability to troubleshoot complex issues systematically

What Will Make Us Love You

  • A passion for Infrastructure as Code, constantly seeking opportunities to automate, optimize, and manage infrastructure through code.
  • Deep expertise in Kubernetes, including cluster design, deployment, and ongoing management for large-scale applications
  • Experience with advanced GCP services and architectures
  • A strong sense of ownership and accountability for the systems you build and maintain
  • Managing infrastructure and applications using Iac, GitOps and ArgoCD

Weave is an equal opportunity employer that is committed to fostering an inclusive workplace where all individuals are valued and supported. We welcome anyone who is hungry to learn, problem-solve and progress regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, or other applicable legally protected characteristics. If you have a disability or special need that requires accommodation, please let us know.

All official correspondence will occur through Weave branded email. We will never ask you to share bank account information, cash a check from us, or purchase software or equipment as part of your interview or hiring process.

+ Show Original Job Post
























Site Reliability Engineer (P3) - Cloud Infrastructure - Remote Eligible
Remote
Engineering
About Weave
A software platform offering integrated business communication tools, including phone, text, and email, for small and medium-sized businesses.