LLM Inference Systems Architect Real-Time Zero-to-One

adaption

📍 Toronto, ON, Canada

Full-time IT & Technology Posted February 02, 2026

Apply Now Similar Jobs

Job Description

                        A forward-thinking AI company in Toronto is seeking an experienced individual to design and build LLM inference systems from the ground up. In this role, you will explore advanced techniques for low-latency serving and collaborate closely with model developers. Ideal candidates will have strong experience optimizing inference frameworks, a deep performance mindset, and solid programming skills in Python. This role provides flexible working arrangements and opportunities for professional development.
   #J-18808-Ljbffr
                    

Apply for this Position

Ready to join adaption? Click the button below to submit your application.

Submit Application

Job Details

Location

Toronto, ON, Canada

Job Type

Full-time