View All Jobs 122616

Systems Design Engineer – AI Customer Systems Validation

Own end-to-end validation strategy for AI DCGPU systems from silicon to rack-level deployment.
Austin
Senior
20 hours agoBe an early applicant
Advanced Micro Devices

Advanced Micro Devices

Designs high-performance CPUs, GPUs, and adaptive computing solutions for PCs, data centers, gaming, and embedded applications.

446 Similar Jobs at Advanced Micro Devices

AMD Systems Design Engineer

What You Do At AMD Changes Everything

At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you'll discover the real differentiator is our culture. We push the limits of innovation to solve the world's most important challenges—striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career.

The Role

In the rapidly expanding world of AI and rack level solutions, solid system level integration and validation becomes paramount. Our DCGPU solutions, including GPUs and rack level products, demand rigorous testing for optimum performance and reliability.

We are seeking a very Senior Systems Design Engineer with extensive silicon to systems understanding to drive complex system level validation focused on alignment with our critical customers to identify unique elements of deploying our products. The Systems Design Engineering is responsible for delivering customer unique validation alignment, execution and debug support. We foster and encourage continuous technical innovation to showcase successes as well as facilitate continuous career development.

The Person

The ideal candidate will possess systems design and validation engineering expertise that will be leveraged towards validation planning, customer communication and root cause resolution. You are an expert system level engineer with understanding of hardware/firmware interaction, system level test scenarios and complex system level issue debug and methodologies. You will be part of a team to drive and improve AMD's abilities to deliver the highest quality, industry-leading technologies to market direct customer work and contributing to drive internal validation activities.

Key Responsibilities

  • Develop a deep understanding of our silicon SoCs, board level designs, system and rack level designs and firmware/software/management stacks to drive the development of system stress tests and complex issue investigations
  • Work with our partners and customers to understand the unique elements of their integration and deployment of our AI products. Track co-validation activities, feature delivery alignment and overall program delivery success.
  • Help lead the debug and triage of issues found during validation cycles or in production environments.
  • Apply learnings from system debugs toward developing stronger test coverage and enable strategies to accelerate issue debug and root cause
  • Work with multiple teams to develop and execute robust validation test plans at the functional, stress and volume levels that meet our customer workloads and requirements.
  • Contribute to technical innovation to improve AMD's capabilities across validation, including tool and script development, technical and procedural methodology enhancement, and various internal and cross-functional technical initiatives.

Preferred Experience

  • Programming/scripting skills (e.g. C/C++, Perl, Ruby, Python)
  • Extensive experience with board/platform-level debug, including delivery, sequencing, analysis, and optimization - structured approach to debug workflows.
  • Extensive knowledge of system architecture, technical debug, and validation strategy
  • Strong analytical/problem-solving skills and pronounced attention to details
  • Extensive exposure to system level integration and debug of SoC system level test scenarios including resets, RAS, system management, networking, workloads or performance
  • Must be a self-starter, and able to independently drive tasks to completion

Academic Credentials

Bachelor or Masters Degree in Computer Engineering or Electrical Engineering

Location

Austin, TX

This role is not eligible for visa sponsorship.

+ Show Original Job Post
























Systems Design Engineer – AI Customer Systems Validation
Austin
Engineering
About Advanced Micro Devices
Designs high-performance CPUs, GPUs, and adaptive computing solutions for PCs, data centers, gaming, and embedded applications.