All roles

Open role

[Remote] Senior Staff Machine Learning Engineer, Data & Eval

Remote · Indonesia Full-time

Note: The job is a remote job and is open to candidates in USA. Airbnb is a leading hospitality company that connects hosts and guests for unique stays and experiences. They are seeking a Senior Staff Machine Learning Engineer to set technical direction and lead execution for ML evaluation and data systems that power customer support AI products.

Responsibilities

  • Define evaluation strategy and success metrics for GenAI systems, aligning offline evaluation with online business and customer experience outcomes
  • Build and scale evaluation frameworks (golden sets, synthetic data, automated regressions, rubric-based grading, LLM-as-judge where appropriate) with strong controls for bias, drift, and reliability
  • Design the data flywheel: instrumentation, feedback collection, data quality checks, labeling strategy, dataset versioning, and governance to support continuous improvement
  • Lead cross-functional quality initiatives across product, ops, and engineering, driving clarity on what “good” looks like and how teams act on evaluation results
  • Develop and productionize pipelines for dataset creation, model monitoring, evaluation-at-scale, and continuous testing (pre-deploy and post-deploy)
  • Drive technical decisions and architecture for evaluation and data infrastructure, balancing speed, rigor, cost, and safety

Skills

  • Educational Background: PhD in Computer Science, Mathematics, Statistics, or related technical field (or equivalent practical experience)
  • Industry Experience: 10+ years building, testing, and shipping ML/AI systems end-to-end; including 2+ years of experience with GenAI/LLM systems in production
  • Leadership Experience: 5+ years leading large, ambiguous technical initiatives as a senior IC, influencing roadmap and engineering/science direction across teams
  • Technical Proficiency: Deep expertise in evaluation methodology (offline/online alignment, metric design, human-in-the-loop evaluation, A/B testing, power analysis, regression testing)
  • Hands-on experience with GenAI systems, including orchestration, retrieval, tool calling, memory, etc
  • Experience building data pipelines and quality systems (labeling workflows, dataset curation, versioning, monitoring, and governance)
  • Solid ML fundamentals and best practices (model selection, training/serving, monitoring, reliability, and model lifecycle management)
  • Customer Support Systems: Experience applying ML/AI to customer support workflows (e.g., agent assist, classification/routing, resolution recommendation, QA)
  • Infrastructure & Quality at Scale: Experience building robust evaluation platforms for agent behavior validation, safety/guardrails, and continuous improvement
  • Agile Practice for Applied AI: Proven ability to take evaluation and data flywheel work from incubation to production, iterating quickly while maintaining scientific rigor

Benefits

  • Bonus
  • Equity
  • Benefits
  • Employee Travel Credits

Company Overview

  • Airbnb is an online community marketplace for people to list, discover, and book accommodations through mobile phones or the Internet. It was founded in 2008, and is headquartered in San Francisco, California, USA, with a workforce of 5001-10000 employees. Its website is https://www.airbnb.com.
  • Company H1B Sponsorship

  • Airbnb has a track record of offering H1B sponsorships, with 59 in 2026, 234 in 2025, 176 in 2024, 160 in 2023, 270 in 2022, 250 in 2021, 274 in 2020. Please note that this does not guarantee sponsorship for this specific role.
  • More open positions

    [Remote] Account Manager

    Work from home Full-time role

    [Remote] Financial Analyst

    Work from home Full-time role

    [Remote] REMOTE - Information Security Engineer III - R12693

    Work from home Full-time role

    [Remote] Growth Media Sr. Strategist

    Work from home Full-time role

    [Remote] Network Engineer - Patient Monitoring (Field: Philadelphia/Allentown/Scranton, PA or Mercerville, NJ)

    Work from home Full-time role

    Experienced Remote Data Entry/Mail Room Clerk – Administrative Support Specialist

    Work from home Full-time role

    Experienced Virtual Customer Care Professional – Remote Opportunity with careerzynith

    Work from home Full-time role

    Senior Project Manager (Digital Marketing) [Remote]

    Work from home Full-time role

    Loss Prevention Investigator

    Work from home Full-time role

    Online Airport Customer Service Representative – Digital Passenger Support & Travel Experience Specialist at careerzynith

    Work from home Full-time role

    DevSecOps Lead

    Work from home Full-time role

    Senior Manager, Fund Administration

    Work from home Full-time role

    Entry-Level Remote Live Chat Customer Support Specialist – E-Commerce Assistance (No Prior Experience Required)

    Work from home Full-time role

    Phone & Chat Credential Specialist – Remote Healthcare Staffing Support with Bonus Opportunities

    Work from home Full-time role

    Remote P&C Licensed Customer Service Representative – Insurance Policy Support, Upsell & Retention Specialist

    Work from home Full-time role

    Senior Marketing Manager - Home and Personal Care

    Work from home Full-time role

    [Remote] Commercial Account Executive

    Work from home Full-time role

    [Remote] Manager, Software Engineering (Resilience Engineering)

    Work from home Full-time role

    Contingent Worker

    Work from home Full-time role

    Immediate Remote Data Entry & Form Filling Specialist – Flexible Hours, Accuracy‑Focused Role at careerzynith

    Work from home Full-time role

    [Remote] Lead GCP Engineer/Lead Architect

    Work from home Full-time role