All roles

Open role

[Remote] Senior AI Quality Engineer (LLM Evaluation & Automation) 1754

Remote · United Kingdom Full-time

Note: The job is a remote job and is open to candidates in USA. Softgic is a technology company seeking a Senior AI Quality Engineer to own the evaluation harness and quality gate for measurable agent quality. This role involves building and maintaining the eval harness, integrating evaluations into CI, and defining release-gate thresholds.

Responsibilities

  • Build and maintain the MVP eval harness: golden tasks, exception tasks, scorecard metrics, and regression packs
  • Wire evals into CI so quality regressions fail builds and releases
  • Define and maintain release-gate thresholds with Product and the Tech Lead
  • Lay the path for later adversarial and drift-testing expansion without overbuilding MVP scope

Skills

  • Experience evaluating ML, LLM, or non-deterministic systems
  • Strong test and benchmark design capability
  • Comfort working with noisy metrics, thresholds, and probabilistic behavior
  • Good scripting and automation skills

Company Overview

  • Impulsamos la transformación digital y cognitiva de las empresas mediante soluciones tecnológicas innovadoras y personalizadas que optimizan procesos, reducen costos y aceleran resultados. It was founded in 2011, and is headquartered in Sabaneta, Antioquia, COL, with a workforce of 51-200 employees. Its website is https://softwareestrategico.com.
  • More open positions

    [Remote] Financial Planning Consultant

    Work from home Full-time role

    [Remote] Account Executive

    Work from home Full-time role

    [Remote] Data Governance Consultant(Retail Exp. Must)

    Work from home Full-time role

    [Remote] Senior Account Executive

    Work from home Full-time role

    [Remote] Lead Product Insights Analyst

    Work from home Full-time role

    Mortgage Loan Officer - Remote

    Work from home Full-time role

    [Remote] Accounting Finance Recruiter

    Work from home Full-time role

    [Remote] [Remote] SEO Account Manager (Legal/Prof Svcs exp req)

    Work from home Full-time role

    Pharmacovigilance Signal Detection Lead

    Work from home Full-time role

    Care Advocate Behavioral Health - San Diego Only--Remote – USA Remote Jobs

    Work from home Full-time role

    Experienced Junior Data Entry Specialist – Remote Work Opportunity at careerzynith

    Work from home Full-time role

    [Remote] Prevailing Wage & Apprenticeship Project Manager

    Work from home Full-time role

    Commercial Accounts Growth Lead

    Work from home Full-time role

    [Remote] Risk Adjustment Data Analyst

    Work from home Full-time role

    Principal GTM Recruiter (Remote, Contract)

    Work from home Full-time role

    Procurement Specialist (Remote)

    Work from home Full-time role

    LLM Fine-Tuning Engineer

    Work from home Full-time role

    Remote Online Notary (RON) / Mobile Notary

    Work from home Full-time role

    Healthcare Recruiter

    Work from home Full-time role

    Remote Customer Service Representative – Multi‑Channel Support, Order Fulfillment & E‑Commerce Operations

    Work from home Full-time role

    Senior Backend Engineer II, Marketplace

    Work from home Full-time role