View All Jobs 159078

Compute Architecture Software Engineer

Develop GPU-accelerated software solutions to enhance large language model inference performance
Shanghai
Senior
yesterday
NVIDIA

NVIDIA

A leading designer of graphics processing units (GPUs) for gaming and professional markets, as well as system on a chip units (SoCs) for the mobile computing and automotive market.

LLM Inference Software Engineer

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It's a unique legacy of innovation that's fueled by great technology—and amazing people.

Today, we're tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what's never been done before takes vision, innovation, and the world's best talent. As an NVIDIAN, you'll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world.

Join NVIDIA, a leader in advancing computer graphics, PC gaming, and accelerated computing for over 25 years. As an LLM Inference Software Engineer, you will be at the forefront of innovative AI technology, working on the ground-breaking TRTLLM project. This role offers you the exceptional opportunity to accelerate LLM inference using GPU technology, influencing everything from single PCs to clusters with thousands of powerful GPUs. Be part of a team that values creativity, cooperation, and the pursuit of excellence.

What you'll be doing:

  • You will develop and optimize software solutions to accelerate LLM inference using GPU technology.
  • Collaborate closely with a world-class team of engineers to implement and refine GPU-based algorithms.
  • Analyze and determine the most effective methods to improve performance, ensuring seamless execution across diverse computing environments.
  • Engage in both individual and team projects, contributing to NVIDIA's mission of leading the AI revolution.
  • Work in an empowering and inclusive environment to successfully implement groundbreaking AI solutions.

What we need to see:

  • 5+ working years' experience in software engineering, particularly in GPU programming and LLM inference.
  • Strong proficiency in programming languages such as Python, C++, and CUDA.
  • A solid understanding of deep learning frameworks and techniques.
  • Outstanding problem-solving skills and the ability to work collaboratively in a team setting.
  • Ambitious approach with a proven track record of taking initiative and delivering results.
  • A degree in Computer Science, Engineering, or a related field, or equivalent experience.

Widely considered to be one of the technology world's most desirable employers, NVIDIA offers highly competitive salaries and a comprehensive benefits package. As you plan your future, see what we can offer to you and your family.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. We do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

+ Show Original Job Post
























Compute Architecture Software Engineer
Shanghai
Engineering
About NVIDIA
A leading designer of graphics processing units (GPUs) for gaming and professional markets, as well as system on a chip units (SoCs) for the mobile computing and automotive market.