Job Description

You will collaborate closely with researchers to design and scale agents - enabling them to reason, plan, call tools and code just like human engineers. You will work on building and maintaining the core infrastructure for deploying and running these agents in production, powering all our agentic tools and applications and ensuring their seamless and efficient performance. If you're passionate about the latest research and cutting-edge technologies shaping generative AI, this role and team offer an exciting opportunity to be at the forefront of innovation.






What you'll be doing:
+ Design, develop, and improve scalable infrastructure to support the next generation of AI applications, including copilots and agentic tools.
+ Drive improvements in architecture, performance, and reliability, enabling teams to bring to bear LLMs and advanced agent frameworks at scale.
+ Collaborate across hardware, software, and research teams, mentoring and su...

Apply for this Position

Ready to join NVIDIA? Click the button below to submit your application.

Submit Application