All roles

Open role

Senior AI Quality Engineer (LLM Evaluation & Automation) 1754

Remote · Switzerland Full-time

This is a remote position. Owns the eval harness and quality gate from the beginning. This role replaces the old late-stage “Evals Specialist” model with a standing owner for measurable agent quality.

Key Responsibilities

  • Build and maintain the MVP eval harness: golden tasks, exception tasks, scorecard metrics, and regression packs.
  • Wire evals into CI so quality regressions fail builds and releases.
  • Define and maintain release-gate thresholds with Product and the Tech Lead.
  • Lay the path for later adversarial and drift-testing expansion without overbuilding MVP scope.

Requisitos Must-Have Qualifications

  • Experience evaluating ML, LLM, or non-deterministic systems.
  • Strong test and benchmark design capability.
  • Comfort working with noisy metrics, thresholds, and probabilistic behavior.
  • Good scripting and automation skills.

AI-First Expectations

  • Uses AI to generate candidate eval cases and failure hypotheses, but never confuses generated tests with validated quality.
  • Approaches AI quality as an operating system, not a QA afterthought.

What Success Looks Like in the First 90 Days

  • The first reference agent has a published scorecard and gated eval path.
  • Golden and exception tests run automatically.
  • The team can explain what “good enough to ship” means in measurable terms.

More open positions

Supervisor Operations

Work from home Full-time role

Sr. Director, Customer Success

Work from home Full-time role

Senior Endpoint Engineer

Work from home Full-time role

National Account Manager

Work from home Full-time role

National Account Manager

Work from home Full-time role

Remote Part-Time Data Entry Specialist | Flexible Work-From-Home Position – Data Management & Accuracy Expert

Work from home Full-time role

Senior Governance, Risk, Compliance; GRC Analyst

Work from home Full-time role

Remote - Senior Power BI Developer

Work from home Full-time role

Embedded Software Engineer

Work from home Full-time role

Remote Data Entry Coordinator – Clearance Management for careerzynith Entertainment Content (No Experience Required)

Work from home Full-time role

Senior Cloud Engineer I

Work from home Full-time role

Massage Therapist Junior Recruiter

Work from home Full-time role

[Remote] Associate Manager, Measurement Products

Work from home Full-time role

[Remote] Senior Finance Manager

Work from home Full-time role

Sales Manager

Work from home Full-time role

Financial Clearance Representative L1 (PreReg)

Work from home Full-time role

Regional Contracts Manager - Data Centers

Work from home Full-time role

Senior Scrum Master- Remote

Work from home Full-time role

Business Intelligence Engineer (100% remote)

Work from home Full-time role

Travel Industry Associate

Work from home Full-time role

Experienced Chat Agent Senior Associate - Remote Opportunity at careerzynith

Work from home Full-time role