Job Description
Thomson Reuters is seeking a Senior Software Engineer, AI (C#, Cloud). This role is for someone who has specialized experience in machine learning/deep learning domains such as model compression, hardware aware model optimizations, hardware accelerators architecture, GPU/ASIC architecture, machine learning compilers, high performance computing, performance optimizations, numerics or SW/HW co-design.
About the Role
This role involves optimizing LLMs and ML models for high-performance inference, deploying and scaling inference workloads on GPUs across AWS, Azure, GCP and internal Kubernetes clusters, implementing routing and failover strategies, integrating models into production grade APIs, and building containerized inference pipelines to ensure compliance with Thomson Reuters AI standards.
Responsibilities
- Optimize LLMs and ML models for high-performance inference using techniques such as quantization, pruning, distillation, and hardware spec...
Apply for this Position
Ready to join Refinitiv? Click the button below to submit your application.
Submit Application