Job Description

**Overview**

Research Internships at Microsoft provide a dynamic environment for research careers with a network of world-class research labs led by globally-recognized scientists and engineers, who pursue innovation in a range of scientific and technical disciplines to help solve complex challenges in diverse fields, including computing, healthcare, economics, and the environment.

As an Applied Science Research Intern, you will work with a small team to investigate recent Small Language Models (SLM) architectures and techniques, such as recurrent transformers and universal transformers, as potential approaches for maximizing the throughput of Large Language Models (LLMs) with limited high-speed cache. For example, could a useful model be pinned to Very Tightly Coupled Memory (VTCM) in a Qualcomm System on Chip (SoC) for its entire lifecycle? Similarly, could this be achieved in the fast caches of Graphics Processing Units (GPUs) or cloud Neural Processing Units (NPUs...

Apply for this Position

Ready to join Microsoft Corporation? Click the button below to submit your application.

Submit Application