Job Description
**What you’ll achieve**
As a Research Intern at Dell Technologies Office of CTO (OCTO), you will be part of a world class research team delving deep into software and hardware aspects of next generation Generative AI techniques, working on developing highly performance systems that push the state-of-art. For this specific project you will use your deep expertise in machine learning based analytics to develop characterization and performance forecasting techniques for high end graphical processing units (GPUs) when used for inference serving at high levels of concurrency. You will be responsible for developing techniques that use of AI/ML approaches to estimate GPU performance for inferencing tasks based on use of state of art (SOTA) Gen AI models. Specifically, you will be working with the AI Techniques Research team on the Future of AI Inferencing project.
**You will:**
+ Design, evaluate and optimize advanced inference serving ...
As a Research Intern at Dell Technologies Office of CTO (OCTO), you will be part of a world class research team delving deep into software and hardware aspects of next generation Generative AI techniques, working on developing highly performance systems that push the state-of-art. For this specific project you will use your deep expertise in machine learning based analytics to develop characterization and performance forecasting techniques for high end graphical processing units (GPUs) when used for inference serving at high levels of concurrency. You will be responsible for developing techniques that use of AI/ML approaches to estimate GPU performance for inferencing tasks based on use of state of art (SOTA) Gen AI models. Specifically, you will be working with the AI Techniques Research team on the Future of AI Inferencing project.
**You will:**
+ Design, evaluate and optimize advanced inference serving ...
Apply for this Position
Ready to join Dell Technologies? Click the button below to submit your application.
Submit Application