All roles

Open role

[Remote] Staff Site Reliability Engineer

Remote · India Full-time

Note: The job is a remote job and is open to candidates in USA. Blink Health is the fastest growing healthcare technology company that builds products to make prescriptions accessible and affordable to everybody. The Staff Site Reliability Engineer will establish best practices for reliability, define observability strategies, and act as a technical leader to improve platform resilience and operational maturity.

Responsibilities

  • Establish and evolve SRE best practices across the organization, including reliability principles, error budgets, incident response, postmortems, and operational readiness standards
  • Define and drive observability strategy for system health, performance, and reliability, including SLIs/SLOs, alerting quality, dashboards, and service health indicators
  • Design and implement software-driven solutions within the infrastructure domain, automating manual processes and eliminating operational complexity and toil
  • Act as a technical leader and force multiplier, helping set priorities and influencing decision-making across core cloud infrastructure, reliability tooling, and platform architecture
  • Take ownership of large, ambiguous initiatives, driving them from concept to delivery while aligning stakeholders across engineering, security, and product
  • Combine deep knowledge of software development, infrastructure, and security to improve platform resilience, scalability, performance, and compliance
  • Proactively identify systemic risks and reliability gaps, recommending and leading platform upgrades and architectural improvements before they become incidents
  • Partner with engineering teams to improve developer workflows, tooling, and operational maturity, increasing productivity while reducing cognitive load
  • Provide technical mentorship, architecture guidance, and high-quality design and code reviews for engineers across infrastructure and product teams
  • Lead by example in documentation and knowledge sharing, ensuring systems and processes are well-understood and not dependent on individual ownership
  • Participate in and help mature incident response, escalation practices, and post-incident learning across the organization

Skills

  • Bachelor's or Master's degree in Computer Science or equivalent practical experience
  • 7+ years of experience in site reliability engineering, infrastructure engineering, or platform engineering roles, with demonstrated impact at scale
  • Expert-level, methodical troubleshooting across the entire stack, from application to kernel to network
  • Strong command-line proficiency and deep expertise in Linux systems and operating system fundamentals
  • Advanced understanding of networking concepts including load balancing, proxies, DNS, TCP/IP, NAT, and service-to-service communication
  • Experience working across multiple languages (e.g., Python, Go, Bash, and familiarity troubleshooting application stacks such as React or similar)
  • Strong track record of automating repetitive and complex operational work to reduce toil and increase reliability
  • Ability to design and build internal tools (Python or Go) that standardize and scale engineering practices
  • Comfortable operating in an agile environment, with disciplined testing and quality practices
  • Deep experience with cloud platforms, particularly managed services and production-grade architectures
  • Strong expertise in Kubernetes and container orchestration (EKS, Helm), including lifecycle management and operational best practices
  • Proven experience designing and implementing observability systems, including metrics, logging, tracing, dashboards, and alerting
  • Deep understanding of container technologies, security scanning, secrets management, dynamic configuration, and microservices architectures
  • Experience designing and maintaining company-wide IaC codebases using tools such as Terraform, Pulumi, CloudFormation, or Ansible
  • Ability to think holistically about infrastructure design, cost, reliability, security, and long-term maintainability
  • AWS preferred, GCP/Azure acceptable
  • Familiarity with service meshes and advanced traffic management concepts

Company Overview

  • BlinkRx is a prescription access platform that connects patients to branded medications, ensuring transparent pricing and home delivery. It was founded in 2014, and is headquartered in New York, New York, USA, with a workforce of 1001-5000 employees. Its website is https://blinkhealth.com.
  • More open positions

    [Remote] Software Engineer II

    Work from home Full-time role

    [Remote] Director, Corporate Counsel – Healthcare Operations, Risk, and Professional Licensure (Behavioral Health)

    Work from home Full-time role

    [Remote] Finance Specialist | Remote

    Work from home Full-time role

    [Remote] Lead Data Scientist

    Work from home Full-time role

    [Remote] Creative Writer | $60/hr Remote

    Work from home Full-time role

    Experienced Data Entry Clerk – Remote Part-Time Opportunity with careerzynith

    Work from home Full-time role

    [Remote] Enterprise Business Development Manager

    Work from home Full-time role

    HCM Staff Consultant

    Work from home Full-time role

    Robotics Software Engineer

    Work from home Full-time role

    Enterprise Workday Administrator

    Work from home Full-time role

    Remote Online Chat Specialist – Customer Experience Champion for careerzynith (Work‑From‑Home)

    Work from home Full-time role

    Director of Engineering (Remote)

    Work from home Full-time role

    [Remote] Senior Software Engineer – Application & Cloud Security (Remote)

    Work from home Full-time role

    Work From Home Travel Support Specialist

    Work from home Full-time role

    Virtual Receptionist & Maintenance Coordinator

    Work from home Full-time role

    Creator & Affiliate-Marketing Manager:in - TikTok Shop (m/w/d)

    Work from home Full-time role

    Medical Scribe (Remote - Full Time) - Full-time

    Work from home Full-time role

    Investment Associate

    Work from home Full-time role

    Patient Access Liaison - TEPEZZA- Houston, TX

    Work from home Full-time role

    [Remote] Senior Regulatory Affairs Associate - Labeling Compliance Analytics

    Work from home Full-time role

    Channel Account Manager (SLED)

    Work from home Full-time role