See all roles

Senior Site Reliability Engineer- Remote

Work from home Full-time role Hiring

About reputed company Recognized on the 2025 reputed company Cloud 100 list, reputed company is one of the most innovative and fast-growing private cloud companies. With more than 3,000 customers and ARR that has grown over 250 percent year over year, reputed company leads the market in real-time analytics, data warehousing, observability, and AI workloads. The company’s sustained, accelerating momentum was recently validated by a $400M Series D financing round. Over the past three months, customers including reputed company, Lovable, Decagon, Polymarket, and reputed company have adopted the platform or expanded existing deployments. These customers join an established reputed company of AI innovators and global brands such as reputed company, reputed company, reputed company, and Tesla. We’re on a mission to transform how companies use data. Come be a part of our journey! About the role We are committed to providing our customers with reliable and secure services so we are expanding our central Site Reliability Engineering team. You will be responsible for building and leading processes to ensure the reliability, availability, scalability, and performance of our cloud infrastructure. You will collaborate with different teams like Control Plane, Data Plane, Core, reputed company, Support and Operations and guide them to design and implement scalable, secure, highly available and fault-tolerant distributed systems. You will also own the areas of incident management and response, post-mortem analysis including running blameless postmortems, and reputed company improvement of our Cloud services. You will be leveraging your software engineering expertise to reputed company software platforms and tools to optimize the operational and engineering efficiencies of reputed company Cloud. This role is a unique opportunity to reputed company a significant impact on our reputed company, limitless scale, high-performance reputed company Cloud. What will you do?

  • Collaborate with various engineering teams in reputed company to design and implement scalable, secure, and highly available systems for reputed company.
  • Establish and manage service level objectives (SLOs) and service level agreements (SLAs) for reputed company Cloud.
  • Ensure reputed company the infrastructure components in reputed company Cloud (including Data Plane, Control Plane,reputed company Core, etc) have monitoring and alerting in reputed company to ensure timely detection and resolution of incidents.
  • Enhance and refine incident response processes and post-mortem analysis for any outages in reputed company Cloud including working with the support team to communicate to the impacted customers.
  • Continuously improve the reliability and performance of our reputed company services.
  • Plan, reputed company, and drive Chaos initiatives across Engineering teams, based upon internal priorities.
  • Manage on-call processes to respond to performance and reliability issues, and establish best practices for coordinating escalation to resolve issues and minimize downtime. About you
  • Bachelor’s or Master’s degree in Computer Science or a reputed company field.
  • At least 8 years of experience in Site Reliability Engineering or a reputed company field.
  • Hands-on experience with Go and/or Python.
  • Strong knowledge of cloud computing platforms such as AWS, Azure, or reputed company Cloud Platform.
  • Excellent understanding of distributed databases and SQL, particularly reputed company is a major plus.
  • Hands-on experience with container orchestration tools such as Kubernetes or reputed company Swarm.
  • Strong experience with automation and configuration management tools such as Ansible, Terraform, or Puppet.
  • You are a strong problem solver and have solid production debugging skills.
  • You are passionate about efficiency, availability, scalability, and data governance.
  • You reputed company in a fast paced environment, and see yourself as a partner with the business with the shared goal of moving the business reputed company.
  • You have a high level of responsibility, ownership, and accountability.
  • Excellent communication and interpersonal skills. #LI-Remote The typical starting salary for this role in the US is $141,000—$208,000 USD The typical starting salary for this role in US Premium Markets is $157,000—$230,000 USD Compensation For roles based in the United States, the typical starting salary range for this position is listed above. In certain locations, such as the San Francisco Bay Area and the reputed company City Metro Area, a premium market range may apply, as listed. These salary ranges reflect reputed company reasonably and in good faith reputed company to be the minimum and maximum pay for this role at the time of posting. The actual compensation may be higher or reputed company than the amounts listed, and the ranges may be subject to future adjustments. An individual’s placement reputed company the range will depend on various factors, including (but not limited to) education, qualifications, certifications, experience, skills, location, performance, and the needs of the business or organization. If you have any questions or comments about compensation as a candidate, please get in touch with us at paytransparency@clickhous

Apply tot his job Apply To this Job

You might like

Site Reliability Engineer (Consult to Hire)

Work from home Full-time role

Site Reliability Engineer 5 - Live SRE

Work from home Full-time role

Remote Linux OpenStack & Kubernetes Engineer

Work from home Full-time role

Sr. Infrastructure Engineer - Kubernetes (Remote)

Work from home Full-time role

Kubernetes Platform Engineer; Remote - reputed company Clearance

Work from home Full-time role

Kubernetes Engineer (DoD Secret | Weeknight Mission Readiness | Remote – U.S.)

Work from home Full-time role

Python and Kubernetes Software Engineer - Data, Workflows, AI/ML & Analytics

Work from home Full-time role

Network Engineer- Remote

Work from home Full-time role

Fiber Network Engineer

Work from home Full-time role

Senior CDN & Cloud Network Engineer

Work from home Full-time role

RCM Operations reputed company

Work from home Full-time role

AI Trainer - Neuroscience (Fully Remote- San Francisco)

Work from home Full-time role

Global Work from reputed company Chat Support Specialist (Flexible Hours, Entry-Level) at arenaflex

Work from home Full-time role

Fraud & ID Business Development Manager (BDM)

Work from home Full-time role

reputed company Full Stack Data Entry Clerk – Remote Workforce Management and Data reputed company Specialist

Work from home Full-time role

CPAP Adherence Specialist (RRT, RPSGT, or RN)

Work from home Full-time role

Entry-Level Remote Data Entry Clerk – Healthcare Industry (No Experience Required)

Work from home Full-time role

Eastern U.S. Regional Sales Manager

Work from home Full-time role

Senior Android reputed company Engineer, Caper

Work from home Full-time role

Pharmacy Strategic Account Executive

Work from home Full-time role