Job Description
Overview
We are seeking an experienced High Performance Computing (HPC) Engineer to design, maintain, and optimise large-scale computing environments that support data-intensive and compute-heavy workloads. You will work closely with researchers, developers, and infrastructure teams to ensure high availability, performance, and scalability of HPC systems.
Responsibilities
- Design, deploy, and manage HPC clusters (on-prem, cloud, or hybrid)
- Install, configure, and optimise job schedulers (e.g. Slurm, PBS, LSF)
- Tune system performance for CPU, GPU, memory, storage, and network workloads
- Support users with application optimisation and parallelisation
- Automate system administration using scripting and configuration management tools
- Monitor system health, capacity, and performance
- Troubleshoot hardware, software, and performance issues
- Collaborate on future architecture planning and upgrades
Apply for this Position
Ready to join IOVENDO? Click the button below to submit your application.
Submit Application