Open role

[Remote] Senior AI Quality Engineer (LLM Evaluation & Automation) 1754

Remote · United Kingdom Full-time

Note: The job is a remote job and is open to candidates in USA. Softgic is a technology company seeking a Senior AI Quality Engineer to own the evaluation harness and quality gate for measurable agent quality. This role involves building and maintaining the eval harness, integrating evaluations into CI, and defining release-gate thresholds.

Responsibilities

Build and maintain the MVP eval harness: golden tasks, exception tasks, scorecard metrics, and regression packs
Wire evals into CI so quality regressions fail builds and releases
Define and maintain release-gate thresholds with Product and the Tech Lead
Lay the path for later adversarial and drift-testing expansion without overbuilding MVP scope

Skills

Experience evaluating ML, LLM, or non-deterministic systems
Strong test and benchmark design capability
Comfort working with noisy metrics, thresholds, and probabilistic behavior
Good scripting and automation skills

Company Overview

Impulsamos la transformación digital y cognitiva de las empresas mediante soluciones tecnológicas innovadoras y personalizadas que optimizan procesos, reducen costos y aceleran resultados. It was founded in 2011, and is headquartered in Sabaneta, Antioquia, COL, with a workforce of 51-200 employees. Its website is https://softwareestrategico.com.

Apply Now Open full posting

[Remote] Senior AI Quality Engineer (LLM Evaluation & Automation) 1754

More open positions

[Remote] Financial Planning Consultant

[Remote] Account Executive

[Remote] Data Governance Consultant(Retail Exp. Must)

[Remote] Senior Account Executive

[Remote] Lead Product Insights Analyst

Mortgage Loan Officer - Remote

[Remote] Accounting Finance Recruiter

[Remote] [Remote] SEO Account Manager (Legal/Prof Svcs exp req)

Pharmacovigilance Signal Detection Lead

Care Advocate Behavioral Health - San Diego Only--Remote – USA Remote Jobs

Experienced Junior Data Entry Specialist – Remote Work Opportunity at careerzynith

[Remote] Prevailing Wage & Apprenticeship Project Manager

Commercial Accounts Growth Lead

[Remote] Risk Adjustment Data Analyst

Principal GTM Recruiter (Remote, Contract)

Procurement Specialist (Remote)

LLM Fine-Tuning Engineer

Remote Online Notary (RON) / Mobile Notary

Healthcare Recruiter

Remote Customer Service Representative – Multi‑Channel Support, Order Fulfillment & E‑Commerce Operations

Senior Backend Engineer II, Marketplace