Job Description
We are now looking for a Senior Deep Learning Inference Performance Architect!
NVIDIA is seeking a Senior Performance Architect - a creative engineer who loves to squeeze out every cycle of performance from deep learning software. The Inference Architecture team does groundbreaking hardware-software co-design work that focuses on accelerating AI Inference workloads. In this role, you will write performance optimized low level code on today’s GPUs, evaluate and improve state-of-the-art performance techniques in production Large Language Model deployments, and he lp guide our future GPU architecture decisions. If you are someone who enjoys digging deep into GPU architecture details, are passionate about AI, and know where every cycle goes when you write highly tuned software, this role may be a great fit for you.
What you’ll be doing:
+ Develop innovative GPU and system architectures to extend the state of the art in AI Inference performance and ef...
NVIDIA is seeking a Senior Performance Architect - a creative engineer who loves to squeeze out every cycle of performance from deep learning software. The Inference Architecture team does groundbreaking hardware-software co-design work that focuses on accelerating AI Inference workloads. In this role, you will write performance optimized low level code on today’s GPUs, evaluate and improve state-of-the-art performance techniques in production Large Language Model deployments, and he lp guide our future GPU architecture decisions. If you are someone who enjoys digging deep into GPU architecture details, are passionate about AI, and know where every cycle goes when you write highly tuned software, this role may be a great fit for you.
What you’ll be doing:
+ Develop innovative GPU and system architectures to extend the state of the art in AI Inference performance and ef...
Apply for this Position
Ready to join NVIDIA? Click the button below to submit your application.
Submit Application