Research Physics Expert at Mercor
- Contract
- Remote, Remote
Role Overview
We are seeking expert physics researchers to author and verify golden reference solutions for the CritPt benchmark (arXiv:2509.26574v3) — a frontier research-level physics benchmark. Participants will solve CritPt research-level problems end-to-end, audit solutions from other experts, or adjudicate between parallel solution attempts, producing 100%-human-verified reference data used to evaluate large language models on frontier physics reasoning.
Physics Subdomains Covered
High Energy Physics & Mathematical Physics, Biophysics & Statistical Physics, Condensed Matter & AMO, Gravitation / Cosmology / Astrophysics, Quantum Information, Optical Properties of Materials, Magnetic Materials, Measurements in QM.
Key Responsibilities
Solve research-level physics challenges end-to-end with verifiable derivations, code, and peer-reviewed references
Decompose challenges into standalone checkpoint sub-problems that require genuine physical reasoning
Author Python answer templates with auto-grading functions for symbolic or numerical answers
Audit submitted solutions for correctness, scope, and method soundness; deliver actionable feedback across iterations
Adjudicate between parallel solver attempts and decide which solution becomes the golden reference
Document chain-of-thought reasoning, error tolerances, equivalent symbolic forms, and verification test cases
Ideal Qualifications
Solver: PhD or postdoc in the relevant subfield (senior PhD student minimum)
Auditor: Postdoc or junior professor in the relevant subfield (PhD minimum)
Adjudicator: Full professor or industry research PI in the relevant subfield (senior postdoc or junior professor minimum)
Hands-on familiarity with at least two canonical methods of the target subfield, demonstrable through publications (broader coverage strongly preferred)
3–5 representative publications (arXiv ID or DOI), ideally within the last ~5 years and in the target subfield
Working proficiency with LaTeX, Python, Jupyter, and SymPy
Strong written English (B2/C1/C2 minimum; native or near-native preferred)
More About the Opportunity
Expected commitment: ~10 hours/week, sustained across an 8–10 week window per task pool
Pay range: $80–$140 per hour, based on role and demonstrated expertise
Asynchronous work
We consider all qualified applicants without regard to legally protected characteristics and provide reasonable accommodations upon request.
Published 14 days ago • Expires June 20, 2026 20:56