Job Description

Description
Our team is responsible for the AWS Neuron software stack, which powers Generative AI and other advanced ML workloads on AWS's custom-built ML accelerators — Inferentia and Trainium. These accelerators deliver best-in-class performance and cost-efficiency for ML inference and training in the cloud.
We're building a new core group of engineers in TLV (Tel Aviv) to drive innovation in ML systems performance and software. As a Machine Learning Performance Engineer, you'll help shape the direction of the team from the ground up and work on:

Optimizing system performance across the entire ML software stack
Analyzing high-performance ML workloads running on Annapurna hardware
Developing high-performance kernels for critical ML operations
Enhancing the Neuron SDK to improve developer experience and system capabilities
Collaborating across Compiler, Frameworks, and Hardware teams to maximize end-to-end performance

As part of the Performance Engin...

Apply for this Position

Ready to join Amazon? Click the button below to submit your application.

Submit Application