See all roles

Model Evaluator @ Austin, TX/Sunnyvale, CA- Hybrid -1+yr

Work from home Full-time role Hiring

REQUIREMENT Model Evaluator Project Duration: 1 year, with possible extension based on performance Location - Austin, TX/Sunnyvale, CA Work Type - Hybrid ( 3 days office must) Type of Visa - GC/Citizen - Independent Candidates only Technical Skills

  • Strong understanding of LLMs, generative AI, and transformer-based architectures.
  • Experience with Python, data analysis, and model evaluation frameworks.
  • Familiarity with prompt engineering, embeddings, RLHF/RLAIF, and LLM-based scoring methods.
  • Experience building evaluation datasets and working with annotation platforms.
  • Understanding of safety alignment, bias detection, and adversarial testing.
  • Tools & Platforms
  • ML/AI frameworks: PyTorch, TensorFlow, HuggingFace, LangChain.
  • Evaluation/annotation tools: Scale AI, GroundTruth, Labelbox, Prodigy.
  • Prompt testing tools: Weights & Biases, MLflow, OpenAI evals, LLM-as-a-judge pipelines.

Thanks & Regards, John Stanley- Sr. BDM / Delivery Manager Maintec Technologies Inc 8801 Fast Park Drive, Ste. 301, Raleigh, NC 27617 Mobile: +1 (919) 267-1887 / +91- 98411-45549 Email: [email protected]; www.maintec.in | www.maintec.com LinkedIn :www.linkedin.com/in/johnstanley1/ Bangalore | Chennai | Hyderabad | Pune | Noida | USA Apply tot his job Apply To this Job

You might like

IN - DLGF Senior Programmer (.NET)

Work from home Full-time role

Développeur Mainframe

Work from home Full-time role

Sr. SAS Programmer

Work from home Full-time role

Senior Systems Programmer (MQ)

Work from home Full-time role

.NET Programmer 3

Work from home Full-time role

Senior IBM z/OS Communications Programmer

Work from home Full-time role

Cobol/Mainframe Designer/Programmer

Work from home Full-time role

Mainframe Z/VSE Systems Technician

Work from home Full-time role

Mainframe MQ Support - MQ System Programmer

Work from home Full-time role

Senior Systems Programmer - Storage

Work from home Full-time role

Freelance IT Product Manager for a LIMS in Clinical Diagnostics

Work from home Full-time role

Experienced Data Entry Specialist – Hybrid Remote and On-Site Opportunity at arenaflex

Work from home Full-time role

Manager, Engineering

Work from home Full-time role

Experienced Data Entry Specialist – Remote Work Opportunity at arenaflex

Work from home Full-time role

Staff Accountant (CPA or EA)

Work from home Full-time role

Experienced Full Stack Customer Service Representative – Remote Support for arenaflex

Work from home Full-time role

Customer Team Leader (District Sales Manager), Cardiovasular Disease - Houston District

Work from home Full-time role

Senior Cyber & Technology Risk Consultant

Work from home Full-time role

Experienced Full Stack Data Entry Specialist – Amazon E-commerce Operations

Work from home Full-time role

Experienced Full Stack Customer Support Specialist – Cloud Security Solutions

Work from home Full-time role