✨ About The Role
- Responsible for maintaining reliable, secure, scalable, and highly available infrastructure and applications that empower over 70,000 Service Professionals to run their businesses
- Collaborating with Product and Engineering peers to support an infrastructure platform that is reliable, scalable, secure, and reduces manual toil
- Driving operational excellence while scaling GlossGenius's AWS cloud footprint and fostering close collaboration with product and engineering teams
- Building tools to help engineers quickly identify problems, driving incident management practices, and improving monitoring and alerting platforms
- Spreading SRE culture throughout GlossGenius, understanding industry and company-wide trends to assess and develop new technologies
âš¡ Requirements
- Experienced in working with cloud technologies in Production Engineer, Cloud Engineer, Site Reliability Engineer, or DevOps equivalent roles for at least 4+ years
- Proficient in infrastructure-as-code principles, IP networking, DNS, CDN, load balancing, HTTP, firewalls, and cloud-first monitoring, logging, and alerting infrastructure
- Skilled in container technology using Docker and Kubernetes, with the ability to write high-quality code in a high-level programming language
- Capable of executing projects from start to finish, outcome-oriented, and participating in on-call rotations
- Strong problem-solving skills, able to manage complexity, engage with stakeholders, and approach situations with a bias to action