Job Description

A leading AI evaluation firm is seeking an experienced software developer to create high-quality coding prompts and evaluate LLM outputs. This remote role is ideal for candidates with over 10 years of experience and strong Python skills. You will document model failures, perform evaluations between ...

Apply for this Position

Ready to join Braintrust? Click the button below to submit your application.

Submit Application