Back to Jobs

Information Systems Expert - AI Evaluator

Remote, USA Full-time Posted 2026-06-06

• *About The Job

  • *Mercor

connects elite creative and technical talent with leading AI research labs. Headquartered in San Francisco, our investors include

  • *Benchmark**

,

  • *General Catalyst**

,

  • *Peter Thiel**

,

  • *Adam D'Angelo**

,

  • *Larry Summers**

, and

  • *Jack Dorsey**

.

  • *Position:**

AI Model Evaluation Specialist

  • *Type:
  • *Contract
  • Compensation:
  • $40–$60/hour
  • *Commitment:
  • *20 hours/week
  • *Role Responsibilities
  • Write realistic prompts reflecting how professionals and consumers seek domain-specific guidance.
  • Evaluate AI-generated responses for factual accuracy, regulatory or clinical correctness, and practical usefulness.
  • Identify fabricated claims, incorrect references, or misleading reasoning across model outputs.
  • Score and rank multiple model responses using structured rubrics across dimensions.
  • Provide written justifications with specific evidence for each evaluation.
  • *Qualifications
  • *Must-Have
  • Master’s degree or higher in Computer Science, Information Systems, or a relevant professional field.
  • Professional experience applying domain expertise in a practitioner or advisory capacity.
  • Familiarity with industry-specific standards, regulations, or clinical guidelines.
  • Strong written communication and critical reasoning skills.
  • *Application Process (Takes 20–30 mins to complete)
  • Submit your resume to begin.
  • Complete the Model Response Evaluation assessment.
  • *Resources & Support**

• For details about the interview process and platform information, please check: https://talent.docs.mercor.com/welcome/welcome

  • For any help or support, reach out to: [email protected]
  • PS: Our team reviews applications daily. Please complete your AI interview and application steps to be considered for this opportunity.*

, Apply tot his job Apply To this Job

Similar Jobs

BDI Evaluator

Remote, USA Full-time

AI Writing Evaluators (Domain Experts) - English Expertise

Remote, USA Full-time

Part-time AI Writing Evaluator (Tier 1)

Remote, USA Full-time

Business Research Evaluator | $30/hr Remote

Remote, USA Full-time

Social Media Evaluator (Ukrainian-United States)

Remote, USA Full-time

Qualified Medical Evaluator (QME) - Pain Medicine Physician - Part Time

Remote, USA Full-time

Regional Vocational Evaluation Specialist

Remote, USA Full-time

Lead Program Evaluator – Title III / Federal Education Grants

Remote, USA Full-time

Spanish Speaking CFTSS OLP Supervisor/Evaluator (Remote)

Remote, USA Full-time

Manufacturing Expert - Quality Evaluator

Remote, USA Full-time

Experienced Medical Customer Service Representative – Remote Opportunity with arenaflex

Remote, USA Full-time

Flexible Part-Time Evening Remote Data Entry Specialist | Work From Home Opportunity | Training Provided

Remote, USA Full-time

Consultant I, Medical Writing

Remote, USA Full-time

Experienced Part-Time Data Entry Typist – Agriculture, Fishing, and Forestry Industry

Remote, USA Full-time

Growth Marketing Manager (SEM & Paid Social) - full time & full remote (AZ/ CA/ OR/ TX residents)

Remote, USA Full-time

Remote Solar Sales Closer

Remote, USA Full-time

Experienced Customer Service Representative – Remote Healthcare Appointment Scheduling

Remote, USA Full-time

APTPUO - Fall 2026 - ENG1100 AI00

Remote, USA Full-time

Experienced Data Entry Associate – Remote Opportunity for Career Advancement and Growth

Remote, USA Full-time

Site Reliability Engineer (SRE) - Remote

Remote, USA Full-time