View All Jobs 169689

Site Reliability Engineer

Develop and deploy reliable firmware updates for Azure GPU hardware infrastructure
Redmond, Washington, United States
Senior
yesterday
Microsoft

Microsoft

A global technology leader known for its software products, cloud services, and hardware like Windows OS and Xbox consoles.

Site Reliability Engineer

The Firmware Deployment team within Microsoft’s Silicon Cloud Hardware Infrastructure Engineering (SCHI) organization is responsible for building and operating world-class software and data-driven services that support Azure’s hardware infrastructure development. Our mission is to enable safe, reliable, and intelligent deployment of firmware payloads across the Azure fleet, ensuring system health and operational quality at scale.

We are seeking a Site Reliability Engineer within the Firmware Deployment team, you will be instrumental in shaping the future of the Azure Fleet. Your primary responsibility will involve developing and applying stable firmware releases across the GPU fleet, as well as potentially supporting other related environments. This work is essential to maintain Microsoft’s security and performance standards while delivering an outstanding experience for our customers.

Your efforts in deploying and managing firmware updates will ensure the reliability and efficiency of Azure’s hardware infrastructure. By focusing on stability and operational excellence, you will help safeguard system health and contribute to the ongoing success and growth of Azure’s global infrastructure.

+ Show Original Job Post
























Site Reliability Engineer
Redmond, Washington, United States
Engineering
About Microsoft
A global technology leader known for its software products, cloud services, and hardware like Windows OS and Xbox consoles.