Job Description
Overview
Sr. ML Performance Engineer, AWS Neuron, Annapurna Labs — Amazon Web Services (AWS)
The Annapurna Labs team at AWS builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon’s custom machine learning accelerators, Inferentia and Trainium.
Key responsibilities
- Analyze and optimize system-level performance of machine learning models across the entire technology stack, from frameworks to runtime
- Conduct detailed performance analysis and profiling of ML workloads, identifying and resolving bottlenecks in large-scale ML systems
- Work directly with customers to enable and optimize their ML models on AWS accelerators, understanding their specific requirements and use cases
- Design and implement compiler optimizations, transforming manual performance improvements into automated compiler passes
- Collaborate across teams to develop innovative optimization techn...
Apply for this Position
Ready to join Amazon Web Services (AWS)? Click the button below to submit your application.
Submit Application