Job Description

Job Description

Role Overview

We are looking for a Software Lead (8+ years’ experience) to own the runtime and neural network (NN) layer of a next-generation AI accelerator platform. This role focuses on designing, optimizing, and implementing NN operators and developing new ops using CUDA/custom runtime APIs to deliver high-performance execution on custom AI hardware.

Key Responsibilities

  • Design and optimize NN operators for performance-critical workloads
  • Develop new NN ops using CUDA/custom runtime APIs
  • Drive runtime-level optimizations across compute, memory, and scheduling
  • Own runtime ↔ NN layer interfaces and execution model
  • Implement and optimize operator fusion (e.g., matmul + bias + LayerNorm) for efficient hardware utilization<...

Apply for this Position

Ready to join Sandisk? Click the button below to submit your application.

Submit Application