All roles

Open role

Member of Engineering – Pre-training, Data Engineering

Remote · United States Full-time

Job Description:

  • Build and maintain high-performance pipelines for trillions of tokens.
  • Deliver diverse and high quality datasets for pre-training foundation models.
  • Closely work with other teams such as Pretraining, Posttraining, Evals and Product to to ensure alignment on the quality of the models delivered.

Requirements:

  • Strong background in building production-grade, distributed data systems for machine learning, with experience in:
  • Orchestration: Slurm, Airflow, or Dagster
  • Observability & Reliability: CI/CD, Grafana, Prometheus, etc.
  • Infra: Git, Docker, k8s, cloud managed services
  • Batched inference (ex: vLLM)
  • Performance obsession, especially with large-scale GPU clusters and distributed pipelines
  • Expert-level python knowledge and ability to write clean and maintainable code
  • Strong algorithmic foundations
  • Proficiency with libraries like Polars, Dask, or PySpark
  • Nice to have:
  • Experience in building trillion-scale SOTA pretraining datasets
  • Experience translating research to production at scale
  • Experience with OCR, web crawling, or evals
  • Prior experience pre-training LLMs

Benefits:

  • Fully remote work & flexible hours
  • 37 days/year of vacation & holidays
  • Health insurance allowance for you and dependents
  • Company-provided equipment
  • Wellbeing, always-be-learning and home office allowances
  • Frequent team get togethers
  • Great diverse & inclusive people-first culture

More open positions

Senior Business Intelligence Manager

Work from home Full-time role

SQL Database Administrator - Advanced for Remote Work

Work from home Full-time role

Senior Database Administrator / Full Time / Remote

Work from home Full-time role

Remote - SAP Oracle DBA $80/hr Srinivasa Kandi

Work from home Full-time role

REMOTE - Junior Database Administrator I (Contingent)

Work from home Full-time role

Senior Manager Client Solutions (Manheim)

Work from home Full-time role

Online Data Analyst Junior for 17 Year Old Teens – Python focus

Work from home Full-time role

Utilization Review Nurse- Remote

Work from home Full-time role

Business Partner Specialist Advisor

Work from home Full-time role

Senior Java Developer

Work from home Full-time role

Remote Life Insurance Sales | No Experience Needed | Uncapped

Work from home Full-time role

Artificial Intelligence Co-Founder / CCO (100 % remote) (m/f/d)

Work from home Full-time role

[Remote] CGI OMS Application Support Analyst (Utility Outage Management Systems)

Work from home Full-time role

Experienced Remote Data Entry Clerk – Data Management and Operations Support

Work from home Full-time role

Associate Medical Sales Representative (Lowell, MA / Nashua, NH)

Work from home Full-time role

Broker Transaction, Analyst - TX - (TEMP) - (REMOTE)

Work from home Full-time role

3rd Party Experienced HC Collections - Remote

Work from home Full-time role

Remote Machine Learning Engineer Talent Network - AI Trainer ($70-$250 per hour)

Work from home Full-time role

Sr Mgr Clinical Data Management Study Lead

Work from home Full-time role

Technical Account Management Manager: Lead the Way in Customer Success at careerzynith

Work from home Full-time role

Senior Headhunter / Independent Recruiter Remote | Commission Only | U.S. Recruiting

Work from home Full-time role