Software Reliability Engineer (SRE) is what you get when you treat operations as if it is a software engineering problem. Our mission is to improve the availability, latency, performance, and security of the Microsoft Teams services. Like traditional operations, we keep important revenue-critical systems up and running, even when natural disasters, bandwidth outages and configuration problems occur. Unlike traditional operations groups, we identify and address these software problems directly through software improvements, innovative technologies, and systems automation.
As a Site Reliability Engineer II in Teams, you will provide leadership, direction and accountability for networking, infrastructure design, end to end implementation and security for Teams services. Proficient collaboration skills will be required working closely with other engineering teams to ensure services/systems are highly stable and performant and meet the expectations of internal stakeholders and external customers and users. This opportunity will allow you to learn what it takes to deploy and run software as a 24x7 enterprise grade cloud service, hone your security expertise and become an expert in webservices optimization.
Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.