Job Description
Job Description:
At TuriyamAI, we are pioneering world leading GenAI semiconductor solutions from India, for India and the World. Our breakthrough solutions are set to redefine the future of AI computing, driving unparalleled efficiency, performance, and accessibility for enterprises worldwide.
Responsibilities:
- Implement quantization techniques to optimize model performance and efficiency.
- Experience with inference optimizations at node and cluster scale
- Experience developing, modifying, optimizing inference frameworks such as vLLM, SGLang
- Layer-by-layer analysis, profiling of multi-modal LLM, stable diffusion, speech and other models
- Collaborate with HW and SW teams to drive innovations and optimizations into the product deployments
- Stay up-to-date with the latest advancements in machine learning and AI technologies, and apply them to improve our products and services.
<...
Apply for this Position
Ready to join Turiyam AI? Click the button below to submit your application.
Submit Application