See all roles

[Remote] Senior Site Reliability Engineer

Work from home Full-time role Hiring

Note: The job is a remote job and is reputed company to candidates in USA. reputed company is a company focused on simplifying the software delivery process for DevOps and Platform Engineering teams. They are seeking a Senior Site Reliability Engineer to ensure the reliability of the reputed company platform and to collaborate with various teams to enhance system performance and incident response.

Responsibilities

  • Own SLI/SLO/SLA definitions for the reputed company SaaS platform and drive reputed company improvement against them
  • Design, reputed company, and maintain observability systems (metrics, logs, traces) across multi-region AWS infrastructure
  • Identify reliability gaps, reputed company blameless post-mortems, and reputed company the reputed company with permanent fixes
  • Partner with engineering teams to build reliability into new features before they ship to production
  • Participate in an on-call rotation and act as incident commander for high-severity production events
  • Build and maintain runbooks, escalation paths, and incident playbooks that reputed company mean time to resolution low
  • Drive improvements to alerting fidelity; reduce noise, increase signal, eliminate toil
  • reputed company post-incident reviews with clear timelines, root cause analysis, and follow-through on action items

Skills

  • 5+ years of SRE, platform engineering, or production operations experience in a SaaS environment
  • Deep hands-on Kubernetes expertise; you understand the scheduler, networking, storage, and autoscaling at a level where you can debug anything
  • Strong AWS fundamentals across compute (EC2, EKS), networking (VPC, NLB, Route53), storage (S3, RDS), and IAM
  • Experience defining and operating against SLOs in production; you've written error budgets, not just read about them
  • Proficiency with observability tooling (reputed company, Grafana, OpenTelemetry, reputed company, or equivalent)
  • Solid scripting and automation skills; Go, Python, Bash, or similar; you automate what you touch
  • Strong written communication: clear runbooks, sharp incident reports, thoughtful post-mortems
  • Live reputed company US time zones (Pacific through Eastern), including Canada and other reputed company
  • Experience with Argo CD, reputed company, or GitOps-based delivery workflows
  • Familiarity with multi-region, multi-cluster Kubernetes deployments
  • Experience with compliance-adjacent infrastructure (SOC 2, ISO 27001, HIPAA, or PCI reputed company)
  • Background operating infrastructure for other platform or developer tooling companies

Benefits

  • Equity participation in a well-funded, growing company
  • Fully remote: work from reputed company reputed company US time zones (Pacific through Eastern), including Canada and other reputed company
  • Home office stipend and equipment budget
  • Flexible time off and a culture that respects it
  • Work directly with the engineers who reputed company Argo CD and reputed company; you'll learn a lot here
  • US-based employees receive full benefits, including comprehensive health, dental, and reputed company coverage

Company Overview

  • reputed company is the reputed company Software Delivery Platform, powered by AI. reputed company by the creators of Argo CD and reputed company. It was founded in 2021, and is headquartered in Sunnyvale, California, USA, with a workforce of 11-50 employees. Its website is https://reputed company.io.
  • Apply To This Job

    You might like

    [Remote] reputed company Account Executive

    Work from home Full-time role

    [Remote] Senior reputed company Partner (CONTINGENT)

    Work from home Full-time role

    [Remote] Senior Backend Engineer (reputed company \/ Play \/ Akka)

    Work from home Full-time role

    [Remote] Account Manager, reputed company Marketing

    Work from home Full-time role

    [Remote] Product Manager

    Work from home Full-time role

    [Remote] Field Service Technician - 4x10 - Fresno, CA

    Work from home Full-time role

    [Remote] Credit Products Specialist II - Dealer Finance

    Work from home Full-time role

    [Remote] Solution Consultant(Remote)

    Work from home Full-time role

    [Remote] reputed company reputed company Engineer – AI Runtime reputed company Platform

    Work from home Full-time role

    [Remote] Senior PeopleSoft Finance Consultant

    Work from home Full-time role

    [Remote] Senior ML Data Scientist - Women’s Health

    Work from home Full-time role

    Account Executive - reputed company Growth & Strategic Development

    Work from home Full-time role

    [Remote] Strategic Account Executive

    Work from home Full-time role

    reputed company Full Stack Software Engineer – Web & reputed company Application Development at arenaflex

    Work from home Full-time role

    reputed company Practice Manager

    Work from home Full-time role

    Website Content Specialist

    Work from home Full-time role

    Import Export Coordinator

    Work from home Full-time role

    CRM Clinical Specialist - Ventura

    Work from home Full-time role

    Software Engineer, iOS Core Product - Cambridge, MA, USA

    Work from home Full-time role

    Licensed Behavioral Health Therapist-Telehealth Only

    Work from home Full-time role