Job Description

We're looking for a Vision AI Engineer to help build next‑generation video intelligence systems powered by modern vision‑language models. You'll work across the full video understanding stack—combining multimodal foundation models with established analytics approaches to deliver reliable, production‑ready AI solutions.

Key Responsibilities

  • Build end‑to‑end video analytics pipelines using vision‑language models.
  • Fine‑tune and adapt foundation models for domain‑specific video understanding.
  • Integrate VLM reasoning with traditional video analytics components.
  • Develop and maintain inference pipelines for video and multimodal data.
  • Deploy and optimize models for scalable, high‑performance production use.
  • Diagnose model issues and strengthen system stability and robustness.
  • Collaborate with product and engineering teams to deliver AI-driven features.

Required Qualifications

  • Strong background...

Apply for this Position

Ready to join ST Engineering? Click the button below to submit your application.

Submit Application