See all roles

Staff Site Reliability Engineer

Work from home Full-time role Hiring

Caseware is one of Canada's original Fintech companies, having led the global audit and accounting software industry for over 30 years, with more than 500,000 users across 130 countries and available in 16 different languages. While you might not have heard of us (yet) over 36,000 accounting and audit professionals list Caseware as a skill on their LinkedIn profiles! We are seeking an experienced Site Reliability Engineer with solid software engineering skills and practical knowledge of operating modern cloud‑native infrastructure. In this position, you will contribute to building and scaling our AI platform by ensuring our systems on AWS, Kubernetes, and GitOps workflows are reliable, observable, and automated. The ideal candidate will have advanced technical expertise, excellent communication abilities, and a talent for collaborating well with engineering teams. Location: This is a fully remote position located in Romania. You will be reporting to: Amir Toole Contact: Dana Liulica - Senior Talent Acquisition Partner We are seeking an experienced Site Reliability Engineer with solid software engineering skills and practical knowledge of operating modern cloud‑native infrastructure. In this position, you will contribute to building and scaling our AI platform by ensuring our systems on AWS, Kubernetes, and GitOps workflows are reliable, observable, and automated. The ideal candidate will have advanced technical expertise, excellent communication abilities, and a talent for collaborating well with engineering teams. Location: This is a fully remote position located in Romania. You will be reporting to: Amir Toole Contact: Dana Liulica - Senior Talent Acquisition Partner We are seeking an experienced Site Reliability Engineer with solid software engineering skills and practical knowledge of operating modern cloud‑native infrastructure. In this position, you will contribute to building and scaling our AI platform by ensuring our systems on AWS, Kubernetes, and GitOps workflows are reliable, observable, and automated. The ideal candidate will have advanced technical expertise, excellent communication abilities, and a talent for collaborating well with engineering teams. Location: This is a fully remote position located in Romania. You will be reporting to: Amir Toole Contact: Dana Liulica - Senior Talent Acquisition Partner

Key Responsibilities

  • Maintain reliable, high‑performing AWS production systems.
  • Manage EKS clusters for configuration, scaling, and workload stability.
  • Set up and support Istio service mesh for traffic control and security.
  • Oversee GitOps workflows to ensure secure, consistent infrastructure changes.
  • Create automation tools and platform enhancements.
  • Design, implement, and manage monitoring, logging, and tracing solutions across a diverse range of applications—including AI workloads, microservices, and data pipelines—to ensure visibility, reliability, and rapid issue resolution.
  • Respond to incidents, analyze root causes, and recommend lasting solutions.
  • Work with developers and platform teams to enhance deployments and system operations.
  • Support nx‑based monorepos for scalable, effective developer workflows.
  • On call rotation

Technical Skills

  • Deep understanding of AWS services commonly used in production (EKS, EC2, IAM, networking, load balancing, etc.).
  • Professional experience with Kubernetes (EKS), including workload autoscaling, networking, RBAC, and cluster operations.
  • Hands‑on experience with service meshes, specifically Istio.
  • Expertise with GitHub, GitHub Actions, and modern CI/CD workflows.
  • Experience working with monorepos, especially nx.
  • Understanding of GitOps practices (we use Flux CD).
  • Strong grasp of Linux s

Apply tot his job Apply To this Job

You might like

Software Engineer (Python + Kubernetes)

Work from home Full-time role

Senior Systems Software Engineer, Containers and Kubernetes

Work from home Full-time role

Kubernetes Networking Platform Engineer :: Bethesda, MD (Remote)

Work from home Full-time role

Senior DevOps Engineer - Kubernetes Focused (Hub-Remote: DC or Philly Metro)

Work from home Full-time role

Senior Software Engineer, Managed Orchestration (Managed Kubernetes)

Work from home Full-time role

Forward Deployed Engineer, AI Inference (vLLM and Kubernetes)

Work from home Full-time role

Java Engineer Level III - AWS , Kafka, Kubernetes (MEXICO ONLY)

Work from home Full-time role

Ranchester Kubernetes Engineer; USC or GC W2

Work from home Full-time role

Principal Kubernetes GPU Infrastructure Engineer

Work from home Full-time role

Senior Engineer II, Managed Kubernetes

Work from home Full-time role

Experienced Data Entry Image Review Specialist – arenaflex – Tampa, FL

Work from home Full-time role

Email Marketing Manager (Remote First)

Work from home Full-time role

Market Entry Specialist - AI Trainer - Freelance - 8-20hrs/week - Remote

Work from home Full-time role

E-Commerce Analyst - with an Amazon Focus

Work from home Full-time role

Payer Contracting Data Analyst

Work from home Full-time role

Experienced Remote Live Chat Agent – Customer Service Representative for arenaflex

Work from home Full-time role

Experienced EAP Worklife Customer Support Associate – Delivering Exceptional Service and Promoting Employee Well-being at arenaflex

Work from home Full-time role

Experienced Entry-Level Data Entry Clerk – Remote Opportunity at arenaflex

Work from home Full-time role

Regional Medical Scientific Director (Medical Science Liaison) - Ophthalmology (Nor CA)

Work from home Full-time role

Experienced Healthcare Customer Service Representative – Virtual Team Environment

Work from home Full-time role