Job Description
About The Job
Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benchmark, General Catalyst, Peter Thiel, Adam D'Angelo, Larry Summers, and Jack Dorsey.
Position
Conversational AI Evaluator
Type
Full-time or Part-time Contract Work
Compensation
$45–$80/hour
Location
Remote
Commitment
20+ hours/week
Role Responsibilities
- Evaluate LLM-generated responses to coding and software engineering queries for accuracy, reasoning, clarity, and completeness.
- Conduct fact‑checking using trusted public sources and authoritative references.
- Execute code and validate outputs using appropriate tools to ensure accuracy.
- Annotate model responses by identifying strengths, areas of improvement, and factual or conceptual inaccuracies.
- Assess code quality, ...
Apply for this Position
Ready to join Mercor? Click the button below to submit your application.
Submit Application