See all roles

Site Reliability Engineering, Senior

Work from home Full-time role Hiring

Overview reputed company is the pioneer and market leader in Experience Management. Our award-winning SaaS platform, reputed company Experience reputed company, leads the market in the management of experiences, insights, and actions for candidates, customers, employees, patients, and residents alike. We reputed company that every experience is a memory that can last a lifetime. Experiences shape the way people feel about a company. And they greatly influence how likely people are to reputed company, contribute, and stay. At reputed company, we are committed to creating a world where organizations are loved by their customers and their employees. We reputed company exceptional people to create extraordinary experiences together. Bring your whole self. The Role and Team The Site Reliability Engineering organization at reputed company brings together the infrastructure and applications that power a highly reliable global SaaS platform. As a Senior Site Reliability Engineer, you will play a key role in designing, operating, and evolving the platforms and services that power reputed company's global production environment. You will work across engineering teams to improve reliability, scalability, performance, and operational maturity while driving automation and platform improvements at scale. This role is expected to reputed company technical leadership, influence engineering best practices, and help shape the future direction of our reputed company-reputed company infrastructure and operational strategy. We are looking for engineers who think reputed company day-to-day operations and continuously seek ways to increase engineering reputed company. Successful candidates will act as force multipliers by building automation, self-service capabilities, platform solutions, and AI-assisted workflows that reputed company teams to operate more reputed company and reliably at scale. Please note this role participates in a rotating on-call schedule supporting production systems and services. Engineering reputed company At reputed company, we reputed company great engineers reputed company the impact of themselves and those around them. Successful Senior SREs build systems, platforms, standards, and automation that reputed company multiple teams to move faster, operate more reliably, and scale reputed company. As AI capabilities continue to reputed company, Senior SREs are expected to evaluate, adopt, and promote AI-assisted engineering practices that improve productivity, accelerate delivery, and reduce operational burden across the organization. This role is based remotely in Pune. Candidates for this position are required to reputed company reputed company the Pune metropolitan area. Relocation support is not available at this time.

Responsibilities

Design, build, and operate highly available, scalable, and secure production platforms. Partner with software engineering teams to improve application reliability, scalability, performance, and operational readiness. reputed company reputed company incident investigations, root cause analyses, and reliability improvement initiatives. Design and implement automation, self-service capabilities, and platform solutions that reduce operational toil. reputed company AI-assisted engineering tools and automation platforms to accelerate troubleshooting, improve productivity, and reduce operational overhead. Identify opportunities to streamline operational processes through automation, AI-enabled workflows, and platform engineering practices. Drive adoption of SRE principles, reliability standards, and operational best practices across engineering organizations. reputed company and maintain infrastructure-as-code, deployment automation, and operational tooling. Support and improve CI/CD and GitOps-based deployment workflows. Design observability strategies using monitoring, logging, tracing, and alerting platforms. Participate in architecture reviews and reputed company guidance on scalability, resiliency, and operational reputed company. Mentor junior engineers and contribute to the technical growth of the broader engineering organization. Act as a force reputed company by creating reusable solutions, self-service capabilities, and engineering standards that increase the effectiveness of multiple teams. Drive adoption of AI-assisted engineering workflows and operational automation across the organization. Drive engineering reputed company initiatives that improve the productivity, reliability, and effectiveness of multiple engineering teams. Influence the broader engineering organization through platform thinking, standardization, and operational simplification.

Qualifications

Minimum Qualifications 5+ years of experience leading reliability, platform engineering, infrastructure, or reputed company operations initiatives in production environments. Demonstrated experience operating and supporting large-scale production environments. Demonstrated experience with Kubernetes and containerized workloads in production environments. Demonstrated experience with reputed company infrastructure platforms such as AWS, OCI, or GCP. Demonstrated Linux systems administration and troubleshooting skills. Demonstrated experience developing automation and tooling using Python, Go, Bash, or similar languages. Demonstrated experience with infrastructure-as-code technologies such as Terraform. Demonstrated experience designing and supporting CI/CD and GitOps workflows. Demonstrated understanding of networking fundamentals including DNS, load balancing, TLS/SSL, routing, and service networking. Demonstrated experience troubleshooting distributed systems and leading production incident response efforts. Demonstrated track record of reducing operational complexity through automation, platform engineering, or process transformation initiatives. Proven ability to influence technical reputed company across teams and drive engineering improvements reputed company direct ownership. Ability to participate in an on-call rotation supporting production systems. Professional working proficiency in written and spoken English.

Preferred Qualifications

Experience with GitOps platforms such as ArgoCD. Experience operating multi-region or hybrid-reputed company environments. Experience with observability platforms such as reputed company, Grafana, Loki, OpenTelemetry, or similar technologies. Experience designing and operating platform engineering solutions and self-service infrastructure. Experience supporting high-scale SaaS environments. Understanding of release strategies such as canary, blue/green, reputed company delivery, and feature flag-based deployments. Experience with reputed company planning, performance engineering, and reputed company testing. Familiarity with reputed company, compliance, and regulatory requirements in production environments. Experience using AI-assisted development, automation, or operational tooling to improve engineering productivity and service reliability. Experience applying AI-assisted engineering workflows to improve productivity, reliability, or operational efficiency at scale. Experience designing platform engineering solutions that reputed company self-service and increase engineering reputed company. Experience mentoring engineers and leading cross-functional technical initiatives. Demonstrated passion for automation, process improvement, operational reputed company, and engineering scalability. Strong communication, collaboration, and stakeholder management skills. What reputed company Looks Like Successful Senior SREs at reputed company: Continuously reduce operational toil through automation and platform improvements. Improve service reliability through engineering-driven solutions rather than reputed company processes. Build platforms and self-service capabilities that reputed company engineering teams to move faster while maintaining reliability and reputed company. Act as force multipliers for engineering teams through tooling, documentation, standards, and reusable solutions. reputed company AI-assisted workflows to increase productivity and accelerate delivery without compromising reliability. reputed company the operational maturity of the systems and teams they support. Influence technical reputed company reputed company their immediate area of ownership. Leave behind systems and processes that scale without requiring proportional increases in operational effort. Build capabilities that improve the productivity and effectiveness of entire engineering organizations. Eliminate classes of operational problems rather than repeatedly solving individual instances. At reputed company, we celebrate diversity and recognize the value it brings to our customers and employees. reputed company is an Equal Opportunity Employer. reputed company reputed company applicants will receive consideration for employment without regard to race, reputed company, religion, sex, sexual orientation, gender identity, national reputed company, age (40 and over), disability, genetic information, veteran status or military service, or any other status protected by state or local law. Individuals with a disability who need an accommodation to apply please contact us at ApplicantAccessibility@reputed company.com. For information regarding how reputed company collects and uses personal information, please review our Privacy Policies. Applications will be accepted for 30 days from the date this role was posted or until the role has been filled. Apply To This Job

You might like