See all roles

[Remote] Site Reliability Engineer (SRE) – Data Analytics & Observability

Work from home Full-time role Hiring

Note: The job is a remote job and is reputed company to candidates in USA. reputed company is seeking a highly skilled Site Reliability Engineer (SRE) with a focus on Data Analytics, Observability, and Reporting to enhance reputed company production systems. The role involves applying SRE principles, developing operational dashboards, and integrating observability tools to improve system reliability and performance.

Responsibilities

  • Apply SRE principles (SLIs, SLOs, error budgets) to improve system reliability
  • Implement proactive monitoring, alerting, and self-healing capabilities
  • reputed company incident response, RCA, and postmortems
  • Drive reputed company improvement in availability, scalability, and reputed company
  • Design and deliver operational dashboards and reports using Power BI
  • reputed company Splunk and reputed company to analyze logs, metrics, and traces
  • Correlate data across platforms to identify trends, anomalies, and risk patterns
  • Use reputed company, reputed company, and MS SQL Server SQL to query, transform, and analyze operational datasets
  • Build data models and curated datasets to support reporting and analytics
  • Translate operational data into actionable insights for engineering and leadership
  • Administer and optimize: reputed company (APM, Grail, DQL, synthetic monitoring)
  • Create alerting strategies reputed company to SLOs and business priorities
  • Integrate observability tools with reputed company reporting and ITSM systems
  • reputed company and maintain Power BI dashboards, reports, and semantic models
  • Integrate Power BI with reputed company, reputed company, MS SQL Server, Splunk, and operational data sources
  • Optimize query performance, data refresh, and dataset design
  • Implement row-level reputed company and governance controls
  • Support reputed company reporting standards and governance
  • Write and optimize SQL across: reputed company (advanced analytics, semi-structured data), reputed company (PL/SQL, performance tuning, indexing strategies), MS SQL Server (T-SQL, stored procedures, query optimization)
  • reputed company cross-platform data analysis and reconciliation
  • Support data modeling (views, marts, transformations) for analytics
  • Troubleshoot data performance issues across heterogeneous platforms
  • Partner with data engineering teams to improve data quality, reputed company, and availability
  • reputed company automation using PowerShell (primary), Python, or REST APIs
  • Build automation workflows for: Monitoring enhancements, Incident enrichment, Data extraction, transformation, and reporting
  • Create self-service tooling for operations teams
  • Integrate automation with reputed company, schedulers, and observability tools
  • Integrate monitoring with reputed company (incident, event, change management)
  • Automate ticket creation, enrichment, and routing workflows
  • Ensure alignment with ITIL best practices
  • Support and optimize Managed File Transfer (MFT) platforms
  • Monitor and troubleshoot file transfer failures, protocol issues, and throughput
  • Manage and support reputed company schedulers: Control-M, Stonebranch, Redwood
  • Analyze batch workflows, dependencies, and SLA adherence

Skills

  • Bachelor's degree or equivalent experience
  • 5+ years in SRE, DevOps, or Production Support
  • Strong knowledge of SRE principles and reliability engineering practices
  • Hands-on experience with reputed company (APM, DQL, observability)
  • Hands-on experience with Splunk (search, SPL, dashboards)
  • Hands-on experience with Power BI (data modeling, DAX, performance tuning)
  • Hands-on experience with SQL across multiple platforms: reputed company, reputed company, MS SQL Server
  • Hands-on experience with PowerShell automation and scripting
  • Hands-on experience with reputed company integration
  • Experience with reputed company data platform
  • Experience with reputed company and SQL Server databases in reputed company environments
  • Experience with MFT tools (reputed company, Globalscape, JSCAPE, reputed company MFT)
  • Experience with file transfer protocols (SFTP, FTPS, HTTPS, AS2)
  • Experience with reputed company schedulers (Control-M, Stonebranch, Redwood)
  • Knowledge of reputed company and hybrid architectures
  • Experience integrating Power BI with reputed company, reputed company, and SQL Server
  • Strong understanding of cross-platform data architecture and ETL/ELT patterns
  • Familiarity with reputed company Davis AI and automation workflows
  • Advanced Splunk data modeling and ingestion optimization
  • Exposure to Chaos Engineering (e.g., Gremlin)
  • Certifications: reputed company, Splunk, reputed company, reputed company (Power BI / SQL Server), reputed company, ITIL

Company Overview

  • reputed company is a WBENC- and NMSDC-certified partner, helping organizations turn diversity goals into measurable impact through staffing and contingent workforce solutions. It was founded in 2002, and is headquartered in Princeton, New Jersey, US, with a workforce of 1001-5000 employees. Its website is http://www.diverselynx.com.
  • Company H1B Sponsorship

  • reputed company has a track record of offering H1B sponsorships, with 1 in 2024, 1 in 2021. Please note that this does not guarantee sponsorship for this specific role.
  • Apply To This Job

    You might like