Job Description
Job Summary
We are seeking a skilled AI Compiler Optimization Engineer to optimize AI model inference performance through advanced compiler technologies. You will focus on performance tuning for CPU or hybrid CPU/XPU heterogeneous architectures, profiling AI frameworks to discover new optimization opportunities, and delivering cutting‑edge insights from industry research.
Key Responsibilities
- Compiler-Based Performance Optimization
- Implement compiler techniques (e.g., MLIR level optimizations, LLVM backend optimizations) to enhance inference performance on CPU and CPU/XPU hybrid systems.
- Optimize JIT level compute graphs with operator fusion, memory allocation, etc. for latency/throughput improvements.
- Preferred: Experience with LLVM/MLIR development.
- AI Model Profiling & Framework Optimization
- Profile end-to-end inference workflows on frameworks like TensorFlow, PyTorc...
Apply for this Position
Ready to join Huawei Technologies Research & Development (UK) Ltd? Click the button below to submit your application.
Submit Application