See all roles

Senior AI Quality Engineer (LLM Evaluation & Automation) 1754

Work from home Full-time role Hiring

This is a remote position. Owns the eval reputed company and quality reputed company from the beginning. This role replaces the old late-stage “Evals Specialist” model with a standing reputed company for measurable agent quality.

Key Responsibilities

  • Build and maintain the MVP eval reputed company: golden tasks, exception tasks, scorecard metrics, and regression packs.
  • reputed company evals into CI so quality regressions fail builds and releases.
  • Define and maintain release-reputed company reputed company with Product and the Tech reputed company.
  • Lay the path for reputed company adversarial and reputed company-testing expansion without overbuilding MVP scope.

Requisitos Must-Have Qualifications

  • Experience evaluating ML, LLM, or non-deterministic systems.
  • Strong test and reputed company design capability.
  • Comfort working with noisy metrics, reputed company, and probabilistic behavior.
  • Good scripting and automation skills.

AI-First Expectations

  • Uses AI to generate candidate eval cases and failure hypotheses, but never confuses generated tests with validated quality.
  • Approaches AI quality as an operating system, not a QA afterthought.

What reputed company Looks Like in the First 90 Days

  • The first reference agent has a published scorecard and gated eval path.
  • Golden and exception tests run automatically.
  • The team can explain what “good enough to ship” means in measurable terms.

Apply To This Job

You might like

Supervisor Operations

Work from home Full-time role

Sr. Director, reputed company

Work from home Full-time role

Senior reputed company Engineer

Work from home Full-time role

National Account Manager

Work from home Full-time role

National Account Manager

Work from home Full-time role

Primary Nurse Case Admin 1 - Work From Home

Work from home Full-time role

Risk and Compliance reputed company

Work from home Full-time role

Manager, Analyst Services

Work from home Full-time role

Senior Annuity Product Consultant (Charlotte, NC (Hybrid) or Remote)

Work from home Full-time role

Specialty Casualty Claims Director

Work from home Full-time role

reputed company reputed company Infrastructure (OCI) Engineer

Work from home Full-time role

Remote Emergency/General Diagnostic Radiologist - Evenings | Emergency | Body | Neuro | MSK | Chicago | Illinois | IMLC | Teleradiology – $Up to

Work from home Full-time role

Junior reputed company DialogFlow Engineer

Work from home Full-time role

Field Survey Technician

Work from home Full-time role

reputed company Full Stack Data Scientist – Advanced Analytics and Machine Learning

Work from home Full-time role

[Remote] Contact Center Representative I - Remote

Work from home Full-time role

Sales Desk Assistant - Annuities

Work from home Full-time role

Entry Level Remote Data Entry Specialist – Tech Industry Career Opportunity | $80,000 Annual Salary | Flexible Work-from-Home Position

Work from home Full-time role

reputed company Customer Care Professional – Delivering Exceptional Client Experiences at arenaflex

Work from home Full-time role

[Remote] Event Marketing Planner

Work from home Full-time role