Job Description

Description

Our team is responsible for the AWS Neuron software stack, which powers Generative AI and other advanced ML workloads on AWS's custom-built ML accelerators — Inferentia and Trainium. These accelerators deliver best-in-class performance and cost-efficiency for ML inference and training in the cloud.

We're building a new core group of engineers in TLV (Tel Aviv) to drive innovation in ML systems performance and software. As a Machine Learning Performance Engineer, you'll help shape the direction of the team from the ground up and work on:



Optimizing system performance across the entire ML software stack

Analyzing high-performance ML workloads running on Annapurna hardware

Developing high-performance kernels for critical ML operations

Enhancing the Neuron SDK to improve developer experience and system capabilities

Collaborating across Compiler, Frameworks, and Hardware teams to maximize end-to-end performan...

Apply for this Position

Ready to join Amazon? Click the button below to submit your application.

Submit Application