Job Description
Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not permanent employment.
Please submit your CV in English and indicate your level of English proficiency.
What this opportunity involves
- Create structured test cases that simulate complex human workflows
- Define gold-standard behavior and scoring logic to evaluate agent actions
- Analyze agent logs, failure modes, and decision paths
- Work with code repositories and test frameworks to validate your scenarios
- Iterate on prompts, instructions, and test cases to improve clarity and difficulty
- Ensure that scenarios are production-ready, easy to run, and reusable
What we look for
- 3+ years of software development experience with strong Python focus
- Experience with Git and code repositori...
Apply for this Position
Ready to join Real Trends? Click the button below to submit your application.
Submit Application