Job Description

About The Role

Join Cerebras as a Performance Engineer within our innovative Runtime Team. Our groundbreaking CS-3 system, hosted by a distributed set of modern and powerful x86 machines, has set new benchmarks in high-performance ML training and inference solutions. It leverages a dinner-plate sized chip with 44GB of on-chip memory to surpass traditional hardware capabilities. This role will challenge and expand your expertise in optimizing AI applications and managing computational workloads primarily on the x86 architecture that run our Runtime driver.

Responsibilities

  • Focus on CPU and memory subsystem optimizations for our Runtime software driver, enabling faster key cloud and ML training/inference workloads across modern x86 machines that form the backbone of our AI accelerator.
  • Develop and enhance algorithms for efficient data movement, local data processing, job submission, and synchronization between various software and hardware com...

Apply for this Position

Ready to join Cerebras Systems? Click the button below to submit your application.

Submit Application