Job Description
We are now looking for a Software Development Engineer for LLM inference!
NVIDIA is hiring software engineers for its TensorRT-LLM team. Academic and commercial groups around the world are using GPUs to power a revolution in deep learning-powered AI, enabling breakthroughs in areas like LLM, ChatGPT, and GenerativeAI that have put DL at the “iPhone moment” for AI. Join the team which is building the inference software which will be used across our product lines! The ability to work on a fast-paced delivery-focused team is required and excellent interpersonal skills are a must.
What you'll be doing:
+ Craft and develop robust inference software that can be scaled to multiple platforms for functionality and performance
+ Performance analysis, optimization, and tuning for Large Language Models (LLMs)
+ Closely follow academic developments in the field of artificial intelligence and feature update TensorRT-LLM
+ Provide feedback into the architec...
NVIDIA is hiring software engineers for its TensorRT-LLM team. Academic and commercial groups around the world are using GPUs to power a revolution in deep learning-powered AI, enabling breakthroughs in areas like LLM, ChatGPT, and GenerativeAI that have put DL at the “iPhone moment” for AI. Join the team which is building the inference software which will be used across our product lines! The ability to work on a fast-paced delivery-focused team is required and excellent interpersonal skills are a must.
What you'll be doing:
+ Craft and develop robust inference software that can be scaled to multiple platforms for functionality and performance
+ Performance analysis, optimization, and tuning for Large Language Models (LLMs)
+ Closely follow academic developments in the field of artificial intelligence and feature update TensorRT-LLM
+ Provide feedback into the architec...
Apply for this Position
Ready to join NVIDIA? Click the button below to submit your application.
Submit Application