✨ About The Role
- The Site Reliability Engineer will be responsible for designing and building systems, tooling, and processes for a scalable and observable platform.
- The role involves empowering development teams to manage their full application stack, optimizing development velocity while ensuring reliability.
- The first 30 days will focus on understanding the current cloud build and deployment processes and the architecture of the Flock system.
- The engineer will participate in a full sprint cycle with the SRE team to plan work effectively.
- By the 60-day mark, the engineer should be able to perform tasks with decreased guidance and assist in resolving help-desk requests.
- The role requires regular participation in meaningful technical discussions and peer reviews.
- The expectation is to achieve a deep understanding of various infrastructure and tooling components within 90 days.
- The position is results-oriented, emphasizing the importance of good days leading to good weeks and months.
âš¡ Requirements
- The ideal candidate will have a minimum of 2 years of experience in an engineering or Site Reliability Engineering (SRE) role.
- A solid understanding of system design, deployment, and maintenance is essential for success in this position.
- Experience with Infrastructure as Code (IaC) tools and practices is crucial for streamlining infrastructure management.
- Proficiency in at least one major cloud provider platform, such as AWS, GCP, or Azure, is required.
- Familiarity with Continuous Integration and Continuous Delivery (CI/CD) pipelines and tools is important for automating the software development lifecycle.
- The candidate should have practical development experience with programming languages like Golang, Python, or TypeScript/Node.js.
- A proactive attitude and a willingness to learn and adapt in a fast-paced environment will be beneficial.
- The candidate should be comfortable participating in technical discussions and peer reviews.