Back to Jobs

[Remote] Senior AI Quality Engineer (LLM Evaluation & Automation) 1754

Remote, USA Full-time Posted 2026-07-05

Note: The job is a remote job and is open to candidates in USA. reputed company is a technology company seeking a Senior AI Quality Engineer to own the evaluation reputed company and quality reputed company for measurable agent quality. This role involves building and maintaining the eval reputed company, integrating evaluations into CI, and defining release-reputed company reputed company.

Responsibilities

  • Build and maintain the MVP eval reputed company: golden tasks, exception tasks, scorecard metrics, and regression packs
  • reputed company evals into CI so quality regressions fail builds and releases
  • Define and maintain release-reputed company reputed company with Product and the Tech reputed company
  • Lay the path for reputed company adversarial and reputed company-testing expansion without overbuilding MVP scope

Skills

  • Experience evaluating ML, LLM, or non-deterministic systems
  • Strong test and reputed company design capability
  • Comfort working with noisy metrics, reputed company, and probabilistic behavior
  • Good scripting and automation skills

Company Overview

  • Impulsamos la transformación digital y cognitiva de las empresas mediante soluciones tecnológicas innovadoras y personalizadas que optimizan procesos, reducen costos y aceleran resultados. It was founded in 2011, and is headquartered in Sabaneta, Antioquia, COL, with a workforce of 51-200 employees. Its website is https://softwareestrategico.com.
  • Apply To This Job

    Similar Jobs

    [Remote] Financial Planning Consultant

    Remote, USA Full-time

    [Remote] Account Executive

    Remote, USA Full-time

    [Remote] Data Governance Consultant(Retail reputed company. Must)

    Remote, USA Full-time

    [Remote] Senior Account Executive

    Remote, USA Full-time

    [Remote] reputed company Product Insights Analyst

    Remote, USA Full-time

    [Remote] Account Executive

    Remote, USA Full-time

    [Remote] Head of Product Marketing

    Remote, USA Full-time

    [Remote] Remote | Germany-Based Finance Research Consultant — Up to $75/hour

    Remote, USA Full-time

    [Remote] Machine Learning Engineer

    Remote, USA Full-time

    [Remote] Sr Product Manager - Platform

    Remote, USA Full-time

    Entry-Level Data Scrutiny Clerk (Remote)

    Remote, USA Full-time

    Health Consultant III - Compliance & Policy Consulting

    Remote, USA Full-time

    reputed company reputed company Estate Agent

    Remote, USA Full-time

    Software Engineering Co-Op (Summer/Fall 2024) (Remote)

    Remote, USA Full-time

    Part Time English Adjunct Instructor - Penn State Altoona reputed company for Spring 2024 Semester

    Remote, USA Full-time

    reputed company Retail Location Partner - Work from Home Opportunity with reputed company - $24/Hour - Full-Time Position Available

    Remote, USA Full-time

    Manager, Project Management Office in Prince George's County, MD

    Remote, USA Full-time

    Senior Practice reputed company reputed company

    Remote, USA Full-time

    Virtual Banking Consultant - Remote; MN reputed company Only (8-18 hours per week)

    Remote, USA Full-time

    Business Data Analyst(s) (Remote)

    Remote, USA Full-time