workfromanywhereworkfromanywhere
All jobs
mercorEngineering

Software Engineer - Evaluation Author

Remote$35–$120/hourPosted 9 days ago

Mercor connects elite creative and technical talent with leading AI research labs. The company is based in San Francisco and has notable investors including Benchmark, General Catalyst, Peter Thiel, Adam D'Angelo, Larry Summers, and Jack Dorsey.

Location: Remote

Salary: $35–$120/hour

Responsibilities

  • Author non-trivial coding tasks with golden solutions and automated verifiers.
  • Design rubrics and grade agent trajectories and model outputs.
  • Improve task and rubric quality through structured review.
  • Evaluate the accuracy and depth of AI-generated content to strengthen reasoning and rigor in model outputs.
  • Work independently and asynchronously to meet deadlines while improving AI model performance.

Requirements

  • 5+ years of software engineering at a real product organization (big tech or venture-backed startup).
  • Strong code quality, systems design, debugging, and testing discipline.
  • Clear written communication (you write instructions others follow).

Additional Information

  • Familiarity with AI coding tools and evals is preferred.
  • The interview process includes a short technical screen, live code review, and domain expert interview, with a $200 payment for completing all three.
  • Application involves uploading a resume, completing an AI interview, and submitting a form.

Location

Remote

Salary

$35–$120/hour

Category

Engineering

Company

mercor

Source

himalayas

Posted

9 days ago

Share this job

XLinkedIn

Similar remote jobs

DiversifiedNewEngineering

Senior Design Engineer - Electronic Security

$122,600 – $165,900
today
CanonicalNewEngineering

Security Software Engineer

Worldwide
today
Crawford & CompanyNewEngineering

Technical Engineer I

Remote – Anywhere in the U.S.
today