Job Description

Overview

In this role, you will collaborate closely with one of our esteemed clients—a global leader in their industry, recognized for their commitment to quality, innovation, and excellence. They have partnered with Dautom as their trusted IT consulting provider for upcoming strategic initiatives.

Responsibilities and Requirements
  • Experience: 7–12 years in Cloud Infrastructure, DevOps, ML Infrastructure, or Platform Engineering.
  • Deep Hands-On Expertise in GPU Systems (NVIDIA A100/H100), Linux, Containers, and Kubernetes.
  • OpenShift AI (RHODS) or equivalent Kubernetes GPU orchestration.
  • LLM Hosting (Llama, Mistral, Falcon, etc.) and supporting Vector Databases/RAG systems.
  • Strong Experience In: TensorFlow, PyTorch, Hugging Face, Distributed Training (DDP, Deep Speed), and ML Ops Stacks (MLflow, Kubeflow).
  • Technical: Deep understanding of GPU compute, HPC architectures, and ML performance profiling. Strong ...

Apply for this Position

Ready to join Dautom? Click the button below to submit your application.

Submit Application