Job Description

Job Summary

We are seeking a skilled AI Compiler Optimization Engineer to optimize AI model inference performance through advanced compiler technologies. You will focus on performance tuning for CPU or hybrid CPU/XPU heterogeneous architectures, profiling AI frameworks to discover new optimization opportunities, and delivering cutting‑edge insights from industry research.

Key Responsibilities

  1. Compiler-Based Performance Optimization
    • Implement compiler techniques (e.g., MLIR level optimizations, LLVM backend optimizations) to enhance inference performance on CPU and CPU/XPU hybrid systems.
    • Optimize JIT level compute graphs with operator fusion, memory allocation, etc. for latency/throughput improvements.
    • Preferred: Experience with LLVM/MLIR development.
  2. AI Model Profiling & Framework Optimization
    • Profile end-to-end inference workflows on frameworks like TensorFlow, PyTorc...

Apply for this Position

Ready to join Huawei Technologies Research & Development (UK) Ltd? Click the button below to submit your application.

Submit Application