See all roles

Generalist Evaluator Expert

Work from home Full-time role Hiring

Mercor is seeking detail-oriented writing experts to contribute to a high-impact AI research project with a leading lab. Freelancers will author prompt–golden answer pairs that train and evaluate advanced language models. This is a short-term, flexible opportunity for professionals with strong academic backgrounds and a knack for instructional clarity. Ideal for those who enjoy distilling complex concepts into well-crafted text. * * * ### Job Details: - Design and Optimize Prompts: Create detailed prompts with multiple constraints and instructions. - Define and Document Evaluation Standards: Establish high-level expectations for correct responses in general consumer contexts, and develop comprehensive rubric. - Conduct Model Testing and Grading: Run prompts through models and assess preliminary outputs against expectations. - Support Benchmarking and Quality Assurance: Collaborate in QA review processes to ensure prompt tasks and rubrics meet rigor, maintaining consistency and reliability before integration into official benchmarks. ### Minimum Qualifications: - BS or BA from a reputable institution completed or in progress - Strong writing and critical thinking skills. - Ability to work independently and meet deadlines. - Significant familiarity with ChatGPT or similar tools for personal decision-making or hobbies / general interests. - US or Canada based. ### Preferred Qualifications: - Experience in teaching or research. ### Application & Onboarding Process: - Complete an AI-led interview, this should take around 15 minutes. - Complete a 45-minute written assessment that will guide you through writing rubrics. - If selected, you will be invited to work on the project. ### More Details About This Role: - This is a remote and asynchronous role — work on your own schedule. - Expect to contribute at least 20 hours per week. - Expect a commitment of around 1 month. - You’ll be working in a structured project environment with clear goals and tools. * * * ### About [Mercor](https://mercor.com/): - Our team is based in San Francisco, CA - We [specialize](https://www.forbes.com/sites/johnwerner/2024/03/20/this-ai-startup-wants-to-create-jobs-not-take-them-away/) in recruiting experts for top AI labs - Our investors include Benchmark, General Catalyst, Adam D’Angelo, Larry Summers, and Jack Dorsey

Apply To This Job

You might like

AI Product Engineer

Work from home Full-time role

Key Account Manager

Work from home Full-time role

Program Associate

Work from home Full-time role

Remote Amazon Marketplace Content & Keyword Optimization Specialist – SEO‑Driven Product Listing Expert for High‑Volume E‑Commerce

Work from home Full-time role

Remote Luxury Fashion Customer Experience Specialist – Amazon Shopbop Full‑Time Work‑From‑Home Role

Work from home Full-time role

Remote Amazon Customer Experience Specialist – Work‑From‑Home Customer Care Center Representative (Full‑Time, Flexible Shifts)

Work from home Full-time role

Remote Amazon Customer Service Representative – Entry‑Level Full‑Time Role with Comprehensive Training, Competitive Pay, Flexible Hours, and Clear Career Advancement Path

Work from home Full-time role

Remote Amazon Virtual Customer Care Advisor – Full‑Time Work‑From‑Home Role Supporting Billing, Insurance, and Pharmacy Services (Arizona Residents)

Work from home Full-time role

Remote Amazon Customer Service Representative – Fully Remote Flexible Schedule, Immediate Openings, Competitive Pay & Comprehensive Benefits

Work from home Full-time role

Part-Time Remote Amazon Customer Experience Specialist – Flexible Home‑Based Chat Support Role (20‑30 hrs/week)

Work from home Full-time role

Licensed Insurance Agent – Remote Position

Work from home Full-time role

UPS Engineer- UK South

Work from home Full-time role

Freelance Product Tester - Up to $790/Week - flexible remote work opportunities

Work from home Full-time role

Program Manager with FBI experience (Top Secret clearance needed) Federal

Work from home Full-time role

Entry Level arenaflex Data Entry Specialist – Remote Work Opportunity for Detail-Oriented Individuals

Work from home Full-time role

Junior Backend Java Developer (Enterprise Payments/Credit Card)

Work from home Full-time role

[Remote] Sales Development Representative

Work from home Full-time role

Business Development Representative

Work from home Full-time role

Experienced Data Entry Specialist – Detail-Oriented Administrative Professional for Innovative Consulting Firm

Work from home Full-time role

National Medicaid Medial Director - Long Term Services & Supports (LTSS)

Work from home Full-time role