All roles

Open role

[Remote] Senior Site Reliability Engineer

Remote · Thailand Full-time

Note: The job is a remote job and is open to candidates in USA. ARA is a company focused on Information Technology, and they are seeking a Senior Site Reliability Engineer. The role involves partnering with development and IT teams to enhance system operability and support, while also maintaining operational standards and improving platform stability.

Responsibilities

  • Partner with software developers, platform engineers, and IT staff to improve system design, operability, deployment safety, and production support readiness
  • Define and maintain operational standards, runbooks, support procedures, escalation paths, and service-level objectives
  • Evaluate system architecture and changes to ensure they balance functional requirements, service quality, reliability, security, and compliance needs
  • Drive continuous improvement in platform stability, maintenance, and availability
  • Provide advanced technical support and troubleshooting for complex platform and service issues affecting internal users and stakeholders

Skills

  • 8+ years of experience in Site Reliability Engineering, DevOps, Platform Engineering, Systems Engineering, or related infrastructure roles supporting production services
  • Strong experience with Linux systems administration and troubleshooting in enterprise environments
  • Strong experience operating and maintaining on-prem Kubernetes platforms and all related components including CRI, CNI, and CSI plugins
  • Experience deploying and maintaining applications on Kubernetes using Helm, Kustomize, and similar tooling
  • Experience supporting DevOps tooling such as GitLab, Artifactory, Jira, Confluence
  • Experience with GitOps tools such as FluxCD or ArgoCD
  • Proficiency scripting with at least one of Python, Go, or Bash
  • Strong experience designing, maintaining, and maturing observability tooling including monitoring, dashboards, logging and tracing, and supporting SLOs
  • Strong understanding of reliability engineering concepts: Service health indicators, High availability design, failure reduction, and testing, Operational readiness practices, including developing documentation, runbooks, and architectural descriptions, Incident response, root cause analysis, remediation/recovery
  • Ability to obtain a security clearance, which includes U.S. citizenship
  • Bachelor's degree in CS, Software Engineering or other IT-related field or equivalent experience
  • Experience with multiple Linux distributions including Ubuntu
  • Experience with at least one of the following: Tanzu Kubernetes, Nutanix Kubernetes Platform, Canonical Kubernetes
  • Experience with cloud platforms such as AWS and Azure
  • Experience with infrastructure automation and configuration management
  • Experience managing AI tooling on Kubernetes including MCP Servers, LLM platforms (vLLM, Ollama), Kubeflow
  • Experience with security and compliance considerations in regulated environments
  • DoD experience
  • Active or inactive Secret Security Clearance

Company Overview

  • ARA provides research, engineering, and technical support services. It was founded in 1979, and is headquartered in Albuquerque, New Mexico, USA, with a workforce of 1001-5000 employees. Its website is https://www.ara.com.
  • More open positions

    [Remote] Account Sales Manager - Pennsylvania

    Work from home Full-time role

    [Remote] Senior Developer Advocate Engineer - Robotics and Physical AI

    Work from home Full-time role

    [Remote] Lead Web Platform Engineer

    Work from home Full-time role

    [Remote] Cloud Operations Engineer Job Details | Capgemini

    Work from home Full-time role

    [Remote] Senior Business Analyst

    Work from home Full-time role

    Senior Product Designer (Design Systems)

    Work from home Full-time role

    Remote Hotel Reservations Agent - Work From Home Opportunity

    Work from home Full-time role

    [Remote] Director Clinical Development, Solid Tumors, GI

    Work from home Full-time role

    RN Health Coord (bilingual, remote, temporary)

    Work from home Full-time role

    Licensed Medicare Agent

    Work from home Full-time role

    Senior Customer Support Engineer – Hybrid (2 Remote / 3 On‑Site) – Advanced Linux & Networking Troubleshooting – careerzynith – $130K+ Compensation

    Work from home Full-time role

    Head of People Operations

    Work from home Full-time role

    Senior Registry Manager - CorEvitas

    Work from home Full-time role

    [Remote] Senior Agentic AI Engineer

    Work from home Full-time role

    Software Engineer, iOS Core Product - Abuja, Nigeria

    Work from home Full-time role

    [Remote] Project Manager II

    Work from home Full-time role

    Remote Data Entry Assistant – Entry-Level Data Management & Quality Assurance Role at careerzynith (No Experience Required)

    Work from home Full-time role

    FHCS Merchandiser

    Work from home Full-time role

    [Remote] Advanced Vestas Travel Troubleshooting Technician

    Work from home Full-time role

    Remote Dental & Billing

    Work from home Full-time role

    Remote Customer Service Representative – United States (Work‑From‑Home) – careerzynith Global E‑Commerce & Technology Leader

    Work from home Full-time role