See all roles

Head of AI Research

Work from home Full-time role Hiring

Head of Research (Retained Search) Location: Fully Remote (West Cost Time Zone Preferred) Compensation: $160,000 to $200,000 base salary + bonus + equity Employment Type: Full-time Work Authorization: US Citizen, Green Card, or approved work authorization About the Opportunity Our client is an early-stage company operating at the intersection of enterprise data and frontier AI. They are building infrastructure that enables leading AI labs to train and evaluate models using high-fidelity, real-world operational data sourced directly from enterprises. This is a research-first organization where technical credibility defines long-term success. The team is well-capitalized, moving quickly, and focused on building durable advantages through research rigor rather than scale alone. This search is being conducted on a retained basis. Why This Role Exists The company’s long-term advantage depends on two core pillars: access to proprietary real-world data and the ability to convert that data into research-grade assets that AI labs trust. While data access is already being established, research credibility is the defining factor in whether that data is adopted. Poor evaluation work erodes trust immediately. High-quality evaluation work creates long-term partnerships with frontier labs. This role exists to ensure that everything delivered meets the standard of a true research partner, not a vendor. This is a founding hire that will define the technical reputation of the company. The Role We are seeking a Head of Research to build and own the research function end to end. This role begins as a senior individual contributor with full ownership across evaluation, data productization, and lab-facing research work, with a clear path to building and leading a research team. You will serve as the technical front door of the company, working directly with frontier AI labs while defining the standards behind every dataset and evaluation produced. What You Will Own Evaluation and Data Product Pipeline

  • Own the end-to-end pipeline that converts raw enterprise data into evaluation suites, reinforcement learning environments, and model-ready datasets
  • Define quality standards across all stages including ground truth, task difficulty, and safety validation
  • Partner with Engineering on parsing, privacy, and data packaging

Benchmark Design Across Domains

  • Design and standardize benchmarks across verticals such as healthcare, code, energy, and enterprise workflows
  • Determine which domains are viable for high-signal evaluation and where investment should be prioritized
  • Establish the methodology that governs all benchmark development

Research Interface with AI Labs

  • Act as the primary technical counterpart to post-training teams at frontier AI labs
  • Lead technical discussions, evaluations, and ongoing research collaborations
  • Co-design engagements that evolve into long-term data partnerships

Methodology and Quality Control

  • Build evaluation frameworks that detect contamination, reward hacking, verifier ceilings, and other failure modes
  • Define standards for reinforcement learning data creation including reward design and validation
  • Maintain internal methodology documentation that guides both engineering and customer-facing work

Data to Model Translation

  • Design systems that convert multimodal, real-world data into training-ready formats
  • Determine when synthetic data is appropriate versus when additional real-world sourcing is required
  • Build systems that distinguish real model capability gaps from evaluation artifacts

Team Buildout

  • Start as a senior IC with ownership of the research function
  • Build and scale a team of research engineers and applied scientists over time
  • Set the quality bar and act as the calibration point for all research output

What Success Looks Like

  • Benchmarks are trusted and used by frontier AI researchers
  • Evaluation work consistently identifies real model capability gaps
  • Data products are integrated into training workflows
  • Strong, ongoing relationships with research teams at leading AI labs
  • A scalable research function with clear standards and methodology

Who You Are Required

  • Hands-on experience in post-training, evaluation, reinforcement learning data, or applied alignment work
  • Track record of building or co

Apply tot his job Apply To this Job

You might like

Oracle AI Lead / Architect

Work from home Full-time role

Anthropic Fellows Program — AI Safety

Work from home Full-time role

Custom GPT Expert / AI Consultant

Work from home Full-time role

AI Business Transformation & Design Strategist - 11397

Work from home Full-time role

Product Builder (User Experience)

Work from home Full-time role

Sales & Marketing associate

Work from home Full-time role

AI Voice Trainer - Italian

Work from home Full-time role

Data Annotator: Tamil

Work from home Full-time role

Primary School Educator

Work from home Full-time role

AI Annotation Aide – Flexible Hours

Work from home Full-time role

Vaccines Health & Science Professional (HSP) – Marietta, GA

Work from home Full-time role

RN Informatics & Operations

Work from home Full-time role

Family Lawyer

Work from home Full-time role

Major Incident Management (MIM) Support Specialist

Work from home Full-time role

Experienced Remote Data Entry Analyst – Vision Care Operations Support

Work from home Full-time role

Amazon Data Entry Jobs from home - No Experience Needed

Work from home Full-time role

Experienced Client Support Agent – Work From Home Opportunity with arenaflex

Work from home Full-time role

Experienced Customer Service Advocate - Work from Home Opportunity at arenaflex

Work from home Full-time role

Experienced Customer Service Representative – Travel Industry Expert ($25/hour) in arenaflex

Work from home Full-time role

Nevada Licensed Mom and Baby Care Manager

Work from home Full-time role