All roles

Open role

[Remote] AI Research Engineer (Multi-Modal Reinforcement Learning) - 100% Remote Worldwide

Remote · Peru Full-time

Note: The job is a remote job and is open to candidates in USA. Tether.io is pioneering a global financial revolution through their innovative digital finance solutions. They are seeking an AI Research Engineer to drive innovation in multi-modal reinforcement learning, focusing on optimizing decision-making and adaptive behavior across various data modalities to enhance AI performance in real-world challenges.

Responsibilities

  • Conduct research on reinforcement learning algorithms for multimodal models, including diffusion-based approaches for image autoregressive models for multimodal understanding, and unified frameworks that integrate multiple modalities
  • Design and build reinforcement learning infrastructure that supports scalable, distributed training across multimodal systems while maintaining efficiency and reliability
  • Develop and refine reward modeling strategies that improve training stability, align model behavior with desired outcomes, and mitigate reward hacking and related failure modes
  • Create and curate multimodal simulation environments and datasets to support robust training, evaluation, and benchmarking of reinforcement learning systems
  • Design and conduct rigorous benchmarking and evaluation protocols to measure model performance, track progress against baselines, and validate improvements across multimodal tasks
  • Analyze and optimize policy performance across modalities by identifying bottlenecks in training, credit assignment, and cross-modal alignment
  • Investigate and develop next-generation reinforcement learning paradigms that more effectively learn from environment feedback, with the goal of achieving superior state-of-the-art (SOTA) performance
  • Publish research findings in top-tier conferences such as ICML, NeurIPS, ICLR, CVPR, ICCV, ECCV etc

Skills

  • A Master's degree in Computer Science or a related field is required
  • Proven experience running large-scale reinforcement learning experiments in multimodal and vision-centric systems, including online RL settings, with demonstrated impact on domain-specific decision-making and measurable improvements in policy performance
  • Deep understanding of reinforcement learning algorithms and optimization methods applied to vision and multimodal learning problems, with a focus on improving policy stability, exploration, and sample efficiency in complex, high-dimensional environments involving images, video, and other modalities
  • Strong proficiency in PyTorch and deep learning frameworks for vision and multimodal AI, with hands-on experience building end-to-end RL pipelines covering simulation, training, evaluation, and deployment in production-grade systems
  • Demonstrated ability to apply empirical research to solve core RL challenges in multimodal and vision tasks, such as sample inefficiency, exploration-exploitation tradeoffs, and training instability, along with experience designing robust evaluation frameworks and iterating on algorithmic improvements to advance agent performance
  • Proven track record of research publications in top-tier conferences such as ICML, NeurIPS, ICLR, CVPR, ICCV, ECCV etc
  • A PhD in Machine Learning, NLP, Computer Vision, or a closely related discipline is preferred, along with a strong track record of AI research and publications in top-tier conferences

Company Overview

  • Tether has evolved to meet global needs with agility and vision. It was founded in 2014, and is headquartered in Seattle, Washington, USA, with a workforce of 201-500 employees. Its website is https://tether.io.
  • More open positions

    [Remote] Mid-Level Process Engineer (Hybrid/Remote)

    Work from home Full-time role

    [Remote] Head of Marketing

    Work from home Full-time role

    [Remote] Senior Product Manager, Provider

    Work from home Full-time role

    [Remote] Senior Product Manager, Trust & Safety, Integrity and Fraud

    Work from home Full-time role

    [Remote] VP, Finance Excellence and Transformation

    Work from home Full-time role

    [Hiring] Care Manager (LPN/LVN) @PharmD Live

    Work from home Full-time role

    Experienced Data Entry Clerk – Part-time / Full-time Opportunity at careerzynith

    Work from home Full-time role

    Technical Operations Coordinator (Diagnostics Startup)

    Work from home Full-time role

    Werkstudent:in Content Management (w/m/d)

    Work from home Full-time role

    [Remote] Head of Intra Data Center Platforms

    Work from home Full-time role

    Experienced Junior Data Entry Representative – Remote Opportunity with careerzynith

    Work from home Full-time role

    Area Sales Manager

    Work from home Full-time role

    [Remote] Part-Time Intake Specialist (Personal Injury)

    Work from home Full-time role

    Medical Records Coordinator

    Work from home Full-time role

    Informatics Policy Analyst

    Work from home Full-time role

    Remote Quantitative Analyst (Finance) - 75403

    Work from home Full-time role

    [Remote] AI Trainer Jobs in the United States

    Work from home Full-time role

    Health Coach (Remote-EST/CST)

    Work from home Full-time role

    Virtual Flight Coordinator - Airline Remote Job

    Work from home Full-time role

    [Remote] Clinical Service Desk Helpdesk Associate (remote) Job Details | NTT DATA Services

    Work from home Full-time role

    Remote Inbound Chat Specialist – Automotive & RV Sales Support – careerzynith – Tallahassee, FL (Work‑From‑Home)

    Work from home Full-time role