Job Description



We are now looking for a TensorRT-LLM Software Development Engineer! NVIDIA is hiring software engineers for its TensorRT-LLM team. Academic and commercial groups around the world are using GPUs to power a revolution in deep learning-powered AI, enabling breakthroughs in areas like LLM, ChatGPT and Generative AI that have put DL at the “iPhone moment” for AI. Join the team which is building the inferencing software which is foundational to product lines within NVIDIA and across the industry! The ability to work on a fast-paced delivery-focused team is required and excellent interpersonal skills are a must.







What you'll be doing:

+ Craft and develop robust inference software that can be scaled to multiple platforms for functionality and performance

+ Performance analysis, optimization, and tuning for Large Language Models (LLMs)

+ Conduct unit tests and performance tests for different stages of the inference pipe...

Apply for this Position

Ready to join Nvidia? Click the button below to submit your application.

Submit Application