Job Description

Overview

Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benchmark, General Catalyst, Peter Thiel, Adam D'Angelo, Larry Summers, and Jack Dorsey.

Position: AI Model Evaluator

Type: Full-time or Part-time Contract Work

Compensation: $45–$80/hour

Location: Remote

Commitment: Flexible hours

Role Responsibilities

  • Evaluate LLM-generated responses to coding and software engineering queries for accuracy, reasoning, clarity, and completeness.
  • Conduct fact-checking using trusted public sources and authoritative references.
  • Execute code and validate outputs using appropriate tools to ensure accuracy.
  • Annotate model responses by identifying strengths, areas for improvement, and factual or conceptual inaccuracies.
  • Assess code quality, readability, algor...

Apply for this Position

Ready to join Mercor? Click the button below to submit your application.

Submit Application