Job Description
Role Description:
The engineer will be responsible for the design and development of optimization tools for neural networks, transformers, and large language models (LLMs). This role involves applying post-training, training-aware, and other advanced optimization techniques to enhance model efficiency and performance.
Key responsibilities:
. Develop optimization toolchains for computer vision models and large language models (LLMs).
. Perform hardware-aware model optimization and porting for Ambarella platforms.
. Research and evaluate emerging technologies, including pruning, quantization, and fine-tuning techniques for convolutional neural networks (CNNs), transformers, and LLMs.
. Provide technical support and solutions to customers regarding model optimization and deployment.
Requirements:
. Education background: Master degree or degree
. Minimum experience: At least one year of relevant work or academic experience
...
Apply for this Position
Ready to join Ambarella? Click the button below to submit your application.
Submit Application