View All Jobs 170134

Principal Software Engineer - Remote Eligible

Develop GPU virtualization solutions to support large-scale AI workloads in Azure virtual machines
Remote
Senior
yesterday
Microsoft

Microsoft

A global technology leader known for its software products, cloud services, and hardware like Windows OS and Xbox consoles.

Azure HPC/AI Team Opportunity

Azure High Performance Computing and AI Platform (HPC/AI) group is the team behind Azure’s cloud offering that powers some of the most demanding and largest scale AI training and inference workloads in the industry. The virtual machine (VM) series that our team owns combine cutting edge GPUs and accelerators, as well as a state-of-the-art scale-out network infrastructure to enable these workloads. We collaborate with many Microsoft teams and our industry partners to design and bring up the underlying platform, and we build the software to expose this platform as an Azure service.

As a Principal Software Engineer in the Azure HPC/AI team, you will play a critical role in delivering the next generations of our platform by solving technical problems at all levels of the stack, contributing to our codebases to enable new features, working on architectural proposals, and collaborating with our internal and industry partners.

This position involves deep technical work that primarily focuses on HW/SW interactions, device virtualization, and performance analysis of GPU workloads in VMs. Since our team is also responsible for vertical integration of our services, you will also have the opportunity to work with upper layers of the Azure infrastructure.

We are looking for someone who is passionate about quality, wants the customer to succeed and get things done. You will join a phenomenal team of hardworking engineers with deep experience with replication systems, highly available systems, large scale algorithms, dynamic and high-performance solutions at massive scale.

It is an exciting time for the team as we are working on expanding the capacity and range of supported scenarios to support the next 100X growth. Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.

+ Show Original Job Post
























Principal Software Engineer - Remote Eligible
Remote
Engineering
About Microsoft
A global technology leader known for its software products, cloud services, and hardware like Windows OS and Xbox consoles.