NVIDIA leads the generative AI revolution. We're now seeking an experienced AI Software Engineer to optimize LLM inference performance. Our team collaborates with compiler, kernel, hardware, and framework teams to assess bottlenecks, create optimization methods, and validate improvements. If you're passionate about system-level performance, compiler IR, and GPU kernel optimization for deep learning inference, we'd love to consider you for our team.
NVIDIA is recognized as one of the world's most desirable engineering environments, built by teams who value technical depth, innovation, and impact. We work alongside some of the best minds in GPU computing, systems software, and AI. If you're driven by performance, enjoy solving complex problems, and thrive in an environment that rewards initiative and technical excellence, we'd love to hear from you!