Job Description

Description
AWS Utility Computing (UC) provides product innovations — from foundational services such as Amazon Elastic Compute Cloud (EC2), to new product innovations that continue to set AWS’s services and features apart in the industry.

Come develop inference acceleration for AWS Neuron, the complete software stack for Trainium, Amazon's custom cloud-scale machine learning accelerators that power the latest AI models

As the Sr. SDM for the Inference Technology Team, you will lead a strong team of managers and engineers to build fundamental inference technology building blocks and libraries to enable AI developers to optimize model for inference on Trainium and Inferentia devices. You will be responsible for the full development life cycle of inference library and feature development, including reliability and scalability. You will develop the Neuronx_Distributed Inference Libraries and contribute to other popular open source Inference Libraries, enabling custo...

Apply for this Position

Ready to join Amazon? Click the button below to submit your application.

Submit Application