All roles

Open role

[Remote] ML Platform Engineer

Remote · Peru Full-time

Note: The job is a remote job and is open to candidates in USA. Bright Vision Technologies is a forward-thinking software development company dedicated to building innovative solutions that help businesses automate and optimize their operations. We are seeking a ML Platform Engineer to design, build, and operate high-performance, highly reliable inference platforms for serving large machine learning models in production.

Responsibilities

  • Design and operate model serving platforms supporting diverse workloads including LLMs, vision models, and recommendation systems
  • Optimize inference performance using continuous batching, paged attention, speculative decoding, and request multiplexing
  • Implement multi-tenant routing, rate limiting, and quality-of-service policies across model endpoints
  • Build autoscaling and capacity management systems that balance latency, throughput, and cost
  • Tune GPU utilization, memory management, and KV cache strategies for LLM serving workloads
  • Integrate model serving with API gateways, identity systems, and observability platforms
  • Implement caching, prompt deduplication, and response reuse strategies where appropriate
  • Drive end-to-end observability including latency histograms, queue dynamics, GPU utilization, and error tracking
  • Develop deployment workflows including canary releases, shadow testing, and automated rollback
  • Operate incident response for high-availability AI services and drive durable reliability improvements
  • Collaborate with ML and product teams to support new model releases and capability rollouts
  • Implement security controls including request signing, content filtering, and abuse detection at the serving layer
  • Document operational procedures, performance characteristics, and tuning guidance for internal teams
  • Stay current with AI serving research and translate advances into production capabilities

Skills

  • Bachelor's or Master's degree in Computer Science or a related field
  • Six or more years of experience in distributed systems, infrastructure, or ML platform engineering
  • Strong proficiency in Python and a systems language such as Go, Rust, or C++
  • Deep experience operating high-throughput, low-latency services in production
  • Hands-on experience with LLM or large model inference frameworks such as vcLLM or TensorRT-LLM
  • Strong understanding of GPU architecture, memory hierarchies, and accelerator utilization
  • Familiarity with Kubernetes, autoscaling, and modern cloud platforms
  • Experience with observability stacks including metrics, tracing, and structured logging
  • Solid grounding in performance engineering and capacity planning
  • Strong communication and incident response skills
  • Open-source contributions to model serving infrastructure
  • Experience with multi-region or globally distributed AI serving
  • Familiarity with model quantization, distillation, and compression techniques
  • Exposure to FinOps for AI workloads and cost-efficient serving design
  • Experience supporting external-facing AI APIs at scale

Benefits

  • Competitive base salary commensurate with experience, plus benefits.
  • Full-time, direct W2 with Bright Vision Technologies (no C2C, no 1099, no third-party).
  • No new H1B sponsorship available. H1B transfers welcomed for qualified candidates.
  • 100% remote position (Continental United States).

Company Overview

  • Bright Vision Technologies is an information technology company that offers software development, AI, and cybersecurity services. It was founded in 2020, and is headquartered in Bridgewater, New Jersey, USA, with a workforce of 51-200 employees. Its website is https://bvteck.com.
  • More open positions

    [Remote] AI Performance Engineer

    Work from home Full-time role

    [Remote] AI Infrastructure Engineer

    Work from home Full-time role

    [Remote] Reinforcement Learning Engineer

    Work from home Full-time role

    [Remote] Network Analyst

    Work from home Full-time role

    [Remote] AI Research Engineer (Applied AI)

    Work from home Full-time role

    Senior Benefits Analyst, Income Protection & Leaves

    Work from home Full-time role

    Freight Dispatcher (reputed company) — $1,800 to $4,500+ Weekly Potential

    Work from home Full-time role

    Remote Sales Representative (Work From Home) Apply Today - | Flexible Schedule | Immediate Start |

    Work from home Full-time role

    Cemetery Administrator

    Work from home Full-time role

    Experienced Full Stack Customer Success Engineer – Threat Intelligence and Cyber Risk Management

    Work from home Full-time role

    [Remote] Business Systems Analyst I (Quality Assurance)

    Work from home Full-time role

    Customer Service Representative

    Work from home Full-time role

    Partner, Product and UX Lead

    Work from home Full-time role

    Remote Chat Sales Specialist – English-Speaking Social Media Closer (LATAM, $USD Earnings)

    Work from home Full-time role

    Hospital Contract Definition Analyst, Healthcare

    Work from home Full-time role

    Project Manager

    Work from home Full-time role

    Conseiller en gestion de patrimoine H/F

    Work from home Full-time role

    Product Designer

    Work from home Full-time role

    Senior Health and Safety Consultant

    Work from home Full-time role

    Senior Unified Engineer

    Work from home Full-time role

    Lead Information Risk Analyst

    Work from home Full-time role