Job Description
About The Position
Company Overview:
About the Job
We are looking for an AI Quality & Evaluation Engineer to own the quality planning and execution of an AI-powered chat application operating over complex law enforcement and mobile device data.
This is a highly hands-on role focused on execution rather than high-level QA strategy. You will design, build, and run automated and semi-automated tests for LLM-driven workflows, create evaluation datasets, and continuously stress the system with realistic and extreme investigative scenarios.
What you will be doing
• Design, plan, and execute quality tests for an AI chat application built on LLMs and investigative data.
• Build and maintain automation frameworks for prompt regression testing, multi-turn conversations, and model upgrades.
• Create and curate evaluation datasets used for regression testing, benchmarking, and model comparison.
• Design c...
Apply for this Position
Ready to join cellebrite? Click the button below to submit your application.
Submit Application