Palo Alto Networks runs a large hybrid infrastructure and is one of the largest GCP customers. As a Site Reliability Engineer, you will be part of a team supporting the services running on this infrastructure. This includes automation, architecture, performance, metrics, troubleshooting, security, and reliability.
Our stack includes Kubernetes, Docker, GCP, AWS, Ansible, Terraform, Vault, Gitlab, Spinnaker, Pub/sub, Bigtable, Memorystore, Bigquery, RabbitMq, Kafka, MySQL, Python, and Go. We don't expect you to know all these, but we do expect you to learn the ones needed for this role.
Your impact includes contributing to the success of SRE and DevOps, developing expertise in new technologies, working with developers, researchers, data scientists, and security experts, and designing, building, and operating reliable, secure Cloud infrastructure.
Your experience should include a BS or MS in Computer Science, a related field, or equivalent professional experience or equivalent military experience, expertise in configuration management with a framework such as Ansible, Terraform, Helm, Kubernetes, proficiency in Python and/or Go, and experience in Production Engineering, DevOps, or Site Reliability.