Information Systems Expert - AI Evaluator
• *About The Job
- *Mercor
connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include
- *Benchmark**
,
- *General Catalyst**
,
- *Peter Thiel**
,
- *Adam D'Angelo**
,
- *Larry Summers**
, and
- *Jack Dorsey**
.
- *Position:**
AI Model Evaluation Specialist
- *Type:
- *Contract
- Compensation:
- $40–$60/hour
- *Commitment:
- *20 hours/week
- *Role Responsibilities
- Write realistic prompts reflecting how professionals and consumers seek domain-specific guidance.
- Evaluate AI-generated responses for factual accuracy, regulatory or clinical correctness, and practical usefulness.
- Identify fabricated claims, incorrect references, or misleading reasoning across model outputs.
- Score and rank multiple model responses using structured rubrics across dimensions.
- Provide written justifications with specific evidence for each evaluation.
- *Qualifications
- *Must-Have
- Master’s degree or higher in Computer Science, Information Systems, or a relevant professional field.
- Professional experience applying domain expertise in a practitioner or advisory capacity.
- Familiarity with industry-specific standards, regulations, or clinical guidelines.
- Strong written communication and critical reasoning skills.
- *Application Process (Takes 20–30 mins to complete)
- Submit your resume to begin.
- Complete the Model Response Evaluation assessment.
- *Resources & Support**
• For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome/welcome
- For any help or support, reach out to: [email protected]
- PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.*
, Apply tot his job Apply To this Job