Job Description
Overview
At NVIDIA, we’re tapping into the unlimited potential of AI to define the next era of computing. The GPU acts as the brains of computers, robots, and self-driving cars that can understand the world.
What you'll be doing
- Work closely with internal engineering and product teams and external app developers on solving local end-to-end AI GPU deployment challenges on the NVIDIA RTX AI platform.
- Apply powerful profiling and debugging tools for analyzing demanding GPU-accelerated end-to-end AI applications to detect insufficient GPU utilization resulting in suboptimal runtime performance.
- Conduct hands‑on training, develop sample code and host presentations to guide efficient end-to-end AI deployment targeting optimal runtime performance on NVIDIA ARM‑based SoCs.
- Improve Windows LLM & GenAI user experience on NVIDIA RTX by working on feature and performance enhancements of OSS software, including projects like GGML, Llama...
Apply for this Position
Ready to join NVIDIA Gruppe? Click the button below to submit your application.
Submit Application