All jobs
mercorEngineering
Software Engineer - Evaluation Author
Remote$35–$120/hourPosted 9 days ago
Mercor connects elite creative and technical talent with leading AI research labs. The company is based in San Francisco and has notable investors including Benchmark, General Catalyst, Peter Thiel, Adam D'Angelo, Larry Summers, and Jack Dorsey.
Location: Remote
Salary: $35–$120/hour
Responsibilities
- Author non-trivial coding tasks with golden solutions and automated verifiers.
- Design rubrics and grade agent trajectories and model outputs.
- Improve task and rubric quality through structured review.
- Evaluate the accuracy and depth of AI-generated content to strengthen reasoning and rigor in model outputs.
- Work independently and asynchronously to meet deadlines while improving AI model performance.
Requirements
- 5+ years of software engineering at a real product organization (big tech or venture-backed startup).
- Strong code quality, systems design, debugging, and testing discipline.
- Clear written communication (you write instructions others follow).
Additional Information
- Familiarity with AI coding tools and evals is preferred.
- The interview process includes a short technical screen, live code review, and domain expert interview, with a $200 payment for completing all three.
- Application involves uploading a resume, completing an AI interview, and submitting a form.