Job Description
AI Performance Engineer (Cloud AI Engineering) – Senior – Staff – Senior Staff role at Qualcomm .
Engineering Group, Machine Learning Engineering.
Job Summary
Qualcomm is utilizing its traditional strengths in digital wireless technologies to play a central role in the evolution of Cloud AI. We are investing in several supporting technologies including Deep Learning. The Qualcomm Cloud AI team is developing hardware and software solutions for Inference Acceleration.
Responsibilities
- Convert, optimize and deploy models for efficient inference using PyTorch, ONNX.
- Work at the forefront of GenAI by understanding advanced algorithms e.g. attention mechanisms, MoEs and numerics to identify new optimization opportunities.
- Performance analysis and optimization of LLM, VLM, and diffusion models for inference. Scale performance for throughput and latency constraints. ...
Apply for this Position
Ready to join Qualcomm? Click the button below to submit your application.
Submit Application