All roles

Open role

[Remote] Software Engineer

Remote · Poland Full-time

Note: The job is a remote job and is open to candidates in USA. Gold Group Ltd is a leading AI research institute seeking a Software Engineer to join their Benchmarking team. The role involves developing evaluations of AI models and collaborating with researchers to influence the AI community.

Responsibilities

  • Develop and run evaluations of the latest AI models
  • Build new benchmarks
  • Maintain evaluation infrastructure
  • Collaborate directly with researchers producing work that influences policymakers, industry leaders, and the wider AI community

Skills

  • Strong software engineering experience (language agnostic – Python preferred)
  • An interest in LLM evaluations, benchmarking, or AI capability testing
  • Curiosity about frontier AI and a research-oriented mindset
  • Someone who enjoys experimentation, solving difficult technical problems, and improving evaluation frameworks
  • Experience with evaluation frameworks such as Inspect

Benefits

  • Fully remote
  • Three international company retreats each year
  • Flexible working hours

Company Overview

  • Gold Group is celebrating 25 years in Recruitment! As one of the UK’s leading independently owned technical and professional recruitment consultancies. It was founded in 2000, and is headquartered in East Grinstead, West Sussex, GBR, with a workforce of 11-50 employees. Its website is https://www.goldgroup.co.uk/.
  • More open positions

    [Remote] 100% Remote - Sr. Clinical Advisor

    Work from home Full-time role

    [Remote] Lead OCM Consultant -Organizational Change Management

    Work from home Full-time role

    [Remote] Director, Product Management - Brokerage

    Work from home Full-time role

    [Remote] Director of Mechanical Engineering (Building Systems)

    Work from home Full-time role

    [Remote] Director, Clinical Strategy & Operations

    Work from home Full-time role

    Research Project Assistant (Environmental Health and Engineering)

    Work from home Full-time role

    [Remote] Computational CAD Engineer - OpenSCAD

    Work from home Full-time role

    Full-Time Sales Director (Evening & Weekend Shift) – Remote

    Work from home Full-time role

    Industry Principal, MedTech

    Work from home Full-time role

    Kundenberater (all genders) – Finanzen & Versicherung

    Work from home Full-time role

    Clinical Nurse Auditor, RN, CPC (Full-time, Remote)

    Work from home Full-time role

    Freelance Contract Editor(s)

    Work from home Full-time role

    QA Automation Tester - Remote US

    Work from home Full-time role

    In-Home Nurse Practitioner or Physician Assistant (Per Diem) - Grangeville, ID

    Work from home Full-time role

    Remote Data Entry Specialist – Entry‑Level Position with Flexible Hours at careerzynith

    Work from home Full-time role

    Remote Clinical Supervisor Therapists & Social Workers in Group Practice

    Work from home Full-time role

    Utilization Review Coordinator PRN

    Work from home Full-time role

    Account Executive 4, Higher Ed Specialist

    Work from home Full-time role

    Program Manager, Innovation

    Work from home Full-time role

    [Remote] Senior Marketing Decision Scientist II

    Work from home Full-time role

    YouTube Content Creator Intern (Social Media Content Creator)

    Work from home Full-time role