Job Description
A leading AI evaluation firm is seeking an experienced software developer to create high-quality coding prompts and evaluate LLM outputs. This remote role is ideal for candidates with over 10 years of experience and strong Python skills. You will document model failures, perform evaluations between LLMs, and support hands-on work with advanced AI technologies. Candidates fluent in English and with coding annotation experience are preferred. This is a contracting engagement with long-term potential.
#J-18808-Ljbffr
#J-18808-Ljbffr
Apply for this Position
Ready to join Braintrust? Click the button below to submit your application.
Submit Application