Job Description
**Summary:**
We are looking for software engineers to help scale and improve the efficiency of large AI training and inference for HW accelerators. A core part of this is optimising collective operations to enable optimised network utilisation for data sharing.This is an opportunity to work within a highly skilled team, collaborating with a large set of cross-functional partners and help bringing next generation large cluster architectures to life.
**Required Skills:**
Software Engineer - AI/HPC Specialist Responsibilities:
1. Work on collective communications stacks to optimise networking operations, leading to improved AI inference and training model performance
2. Drive implementation of latency and bandwidth critical networking operations, as well as out-of-band signalling
3. Debug custom and third party multi-host, accelerator enabled AI platforms
4. Software development using C++/C and Python
5....
Apply for this Position
Ready to join Meta? Click the button below to submit your application.
Submit Application