See all roles

AI Decision & Response Analyst

Work from home Full-time role Hiring

Responsibilities

  • Evaluate AI model responses for personalization quality, including grounding, integration, and helpfulness.
  • Design and execute multi-turn prompts based on personal context to test AI capabilities.
  • Analyze responses for hallucinations, incorrect personalization, and poor inferences.
  • Perform side-by-side comparison of model outputs to determine quality and effectiveness.
  • Write clear and structured rationales for response evaluations and rankings.
  • Extract and verify debug information to ensure proper use of data sources.
  • Maintain strict data hygiene and ensure accurate documentation of evaluations.
  • Collaborate with cross-functional teams to improve AI model performance.

Requirements

  • Strong proficiency in Polish with excellent reading and writing skills.
  • Experience in data annotation, AI evaluation, content moderation, or a related role.
  • Strong analytical thinking and ability to assess nuanced AI responses.
  • Ability to design creative, multi-turn prompts based on personal context.
  • Understanding of personalization concepts, including identifying incorrect or forced personalization.
  • High attention to detail in evaluating subtle differences in model outputs.
  • Excellent written communication and structured reasoning skills.
  • Ability to work independently in a remote environment.
  • Willingness to use a personal Google account for evaluation purposes.
  • Full-time availability with at least 4 hours overlap with PST.
  • Bachelor’s degree or equivalent experience in a relevant analytical field.

Apply tot his job Apply To this Job

You might like

NURSE EVALUATOR III, HEALTH SERVICES

Work from home Full-time role

Finance Model Prompt Evaluator

Work from home Full-time role

AI Quality Evaluator (Polish)

Work from home Full-time role

Healthcare Research Evaluator (STEM) | $30/hr Remote

Work from home Full-time role

Generative AI Evaluator (Russian) | $15/hr Remote

Work from home Full-time role

Product Manager - Healthcare (Remote)

Work from home Full-time role

Product Owner (Specialty Lines Insurance)

Work from home Full-time role

Product Owner – Digital Enablement

Work from home Full-time role

Product Owner (Data Center) || W.2 only, No C.2.C & No H.1s, E.A. Ds

Work from home Full-time role

AI Product Owner- Quote & Order Management

Work from home Full-time role

Experienced Data Entry Specialist – Healthcare Industry – Join arenaflex

Work from home Full-time role

Experienced Data Entry Specialist – Timely and Accurate Pension Data Management for arenaflex

Work from home Full-time role

Workers’ Compensation Product Management Director

Work from home Full-time role

S/4 HANA Reporting Business Lead

Work from home Full-time role

Multi-Line Claim Adjuster - Commercial Liability Claims (Remote)

Work from home Full-time role

Experienced Customer Service Representative – Health Care Benefits & Services

Work from home Full-time role

Experienced Full Stack Customer Support Specialist – Remote Live Chat Support for arenaflex

Work from home Full-time role

Trainer / Content Developer

Work from home Full-time role

Experienced Customer Service Supervisor – Ground Operations Management

Work from home Full-time role

Experienced Full Stack Customer Service Representative – Remote Work Opportunity with arenaflex

Work from home Full-time role