What You Do At AMD Changes Everything At AMD, our mission is to build great products that accelerate next-generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you'll discover the real differentiator is our culture. We push the limits of innovation to solve the world's most important challenges—striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career.
As a member of the AMD Server Design Health Team, you will use your software engineering skills and knowledge of CPU hardware design to create, enhance, and maintain tools and system level tests to stress AMD's parts for both functional and operation quality. You will be responsible to drive systematic improvements in our stress and feature coverage tools based on your findings and expertise. You will work as part of a broader team to help improve the time-to-market, quality, and reliability of AMD's Servers.
You approach challenges with persistence, creativity, and thoughtful problem-solving. You work as part of a team with strong communication and collaboration skills. You have a solid understanding of both hardware and software, strong problem-solving skills, and drive to understand and learn. You enjoy hands-on work and like to take the initiative to build high-quality tools to test and stress AMD's designs. Does this describe you? If so, then join us!
CPU architecture, X86 (or similar) instruction set architecture, and/or design knowledge. Developing system level tests that can make devices under test consume their full power and run at high frequencies. Developing system level tests that are self-checking and can catch issues with underlying hardware. Software development experience programming in C++ and assembly in a Linux based environment. Silicon chip bring-up, validation and debug.
Understanding of the various aspects of a CPU product definition such as frequency, voltage, thermal design power, performance, etc. Familiarity with microprocessor Design-for-Test (DFT) and Design-for-Debug (DFD) logic, use, and issues. Experience in clocking, reset, power-up sequences and power management. Understanding of typical silicon debug features, infrastructure, and techniques. System-level understanding of CPU/SoC architecture, DRAM/memory, PCIE and boards.
In-depth understanding of server architecture and hardware components, especially AMD's SOCs. Good understanding of various IP blocks in an SOC. Good understanding of various Operating Systems, Strong knowledge of x86 architecture and server technologies. Proficiency in programming languages such as Python and scripting languages like BASH and PowerShell for tool development and automation. Proficient in database management and code version control. Experience in creating, enhancing, and maintaining diagnostic tools for server stress testing. Proficiency in testing methodologies, tools, and strategies. Strong interpersonal and communication skills to collaborate with cross-functional teams, external vendors, and users. Knowledge of data analysis tools and techniques. Good technical writing and communication skills for creating documentation and training materials. Project management skills to plan, prioritize, and execute tasks effectively, especially when working on multiple projects. Proficiency in problem-solving, root cause analysis, and troubleshooting server-related issues.
BS, or MS degree in Electrical, Computer Engineering, Computer Science, preferred
Austin, TX