Job Description

About The Job

Mercor connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include Benchmark , General Catalyst , Peter Thiel , Adam D'Angelo , Larry Summers , and Jack Dorsey .

Position: AI Model Evaluator

Type: Full‑time or Part‑time Contract Work

Compensation: $60–$100/hour

Location: Remote

Role Responsibilities

  • Evaluate LLM-generated responses to coding and software engineering queries for accuracy, reasoning, clarity, and completeness.
  • Conduct fact‑checking using trusted public sources and authoritative references.
  • Execute code and validate outputs using appropriate tools to conduct accuracy testing.
  • Annotate model responses by identifying strengths, areas of improvement, and factual or conceptual inaccuracies.
  • Assess code quality,...

Apply for this Position

Ready to join Mercor? Click the button below to submit your application.

Submit Application