Job Description



NVIDIA is looking for a talented Performance Research and Analysis Engineer to join our Performance group. The ideal candidate will profile and analyze AI workloads on large GPUs and CPUs scale clusters for distributed Deep Learning LLM training and inference focusing at the communication patterns, collectives communication, RDMA, networking and system performance. You will work and interact with many types of HW platforms such as HCAs, Switches, CPUs, GPUs, Systems and also with various SW layers and features. You will experience with simulators and developing performance analysis tools and methodologies to dive deeply into the details, understand performance expectation, limitations, and bottlenecks as part of the root cause analysis of these jobs.







What you'll be doing:

+ Experience and research AI workloads and DL models specifically tailored for large-scale deep learning LLM training on NVIDIA supercomputers with a focus on ...

Apply for this Position

Ready to join Nvidia? Click the button below to submit your application.

Submit Application