Job Description
About The Job
Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benchmark , General Catalyst , Peter Thiel , Adam D'Angelo , Larry Summers , and Jack Dorsey .
Position: AI Model Evaluator
Type: Full‑time or Part‑time Contract Work
Compensation: $60–$100/hour
Location: Remote
Role Responsibilities
- Evaluate LLM-generated responses to coding and software engineering queries for accuracy, reasoning, clarity, and completeness.
- Conduct fact‑checking using trusted public sources and authoritative references.
- Execute code and validate outputs using appropriate tools to conduct accuracy testing.
- Annotate model responses by identifying strengths, areas of improvement, and factual or conceptual inaccuracies.
- Assess code quality,...
Apply for this Position
Ready to join Mercor? Click the button below to submit your application.
Submit Application