View All Jobs 138652

Sr. Power Engineer, Annapurna Labs, Machine Learning Hardware

Lead the design of power delivery systems for ML rack-scale servers at AWS Annapurna Labs
Cupertino, California, United States
Senior
yesterday

Hardware Design Engineer

AWS Utility Computing provides product innovations—from foundational services such as Amazon's Simple Storage Service (S3) and Amazon Elastic Compute Cloud (EC2), to consistently released new product innovations that continue to set AWS's services and features apart in the industry. As a member of the UC organization, you'll support the development and management of compute, database, storage, internet of things (iot), platform, and productivity apps services in AWS, including support for customers who require specialized security solutions for their cloud services. Annapurna Labs (our organization within AWS UC) designs silicon and software that accelerates innovation. Customers choose us to create cloud solutions that solve challenges that were unimaginable a short time ago—even yesterday. Our custom chips, accelerators, and software stacks enable us to take on technical challenges that have never been seen before, and deliver results that help our customers change the world. We are looking for a hardware design engineer with strong skills in both hardware and software. In this role you will be responsible for architecting, designing, simulating and validating power delivery multi-source solutions on machine learning products and support those designs in AWS fleet through their entire life cycle. You will work cross functionally with AWS monitoring teams, members of the hardware design team, supply chain team, vendors, ODMs and additional teams across AWS to improve quality and reliability of products operating in the fleet.

Key job responsibilities in this position include:

  • Drive improvements in availability, and efficiency as they relate to power for AWS servers. These improvements should include enhanced designs, enhanced testing, and improved manufacturing at the suppliers themselves.
  • Identify and drive process improvements and automation in power design, validation, manufacturing and both pre-production and field testability.
  • Develop comprehensive AWS reliability models for server power, taking advantage of our copious production telemetry and deep knowledge of component internals.
  • Serve as power technical lead on some of our most demanding machine learning rack scale projects, including the launch of new services and features that will be deployed and used at massive scale.
  • Work with technology leaders to define and drive critical features and architectural advancements into our rack scale systems.
  • Help drive a power technology roadmap and guide our suppliers to meet Amazon's server power needs for future generation products.
  • Collaborate with other principal engineers to find simple solutions to brain-contorting problems.
  • Ensure the quality & reliability of architecture of our end-to-end system design.
  • Collaborate with our supply chain team to define the sourcing strategy for power commodities.
  • Engage deeply in the operations of our systems, including identifying and rectifying root causes of critical issues.
  • Assist in the career development of others, actively mentoring individuals and the community on advanced technical issues, and helping managers guide the career growth of their team members.
  • Provide technical guidance to multiple teams, increasing their productivity and effectiveness by sharing your deep knowledge and experience.

About the team: Our team is dedicated to supporting new members. We have a broad mix of experience levels and tenures, and we're building an environment that celebrates knowledge-sharing and mentorship. Our senior members enjoy one-on-one mentoring and thorough, but kind, code reviews. We care about your career growth and strive to assign projects that help our team members develop your engineering expertise so you feel empowered to take on more complex tasks in the future.

Diverse Experiences: AWS values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn't followed a traditional path, or includes alternative experiences, don't let it stop you from applying.

Basic qualifications include:

  • B.S. electrical engineering, computer engineering or a related field
  • 8+ years of experience with hardware and software integration for embedded systems or hardware development
  • 5+ years of experience with managing ODM
  • 5+ years of experience as the lead technologist/architect/engineer in the power space for a major technology, component, product, or product line
  • Demonstrated expertise in driving power reliability and efficiency through design enhancement and failure mode analysis
  • Experience with statistical analysis and modeling for reliability, performance, and/or cost

Preferred qualifications include:

  • M.S. computer science or a related field
  • Experience with ML accelerators
  • Experience with data center deployments of compute-related infrastructure

Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.

+ Show Original Job Post
























Sr. Power Engineer, Annapurna Labs, Machine Learning Hardware
Cupertino, California, United States
Engineering
About California Staffing
California Staffing Agency