See all roles

Principal Site Reliability Engineer - Cloud (Remote)

Work from home Full-time role Hiring

Join a dynamic team at the pulse of global markets, where we deliver innovative software and service solutions for essential financial reporting and capital markets transactions. At DFIN, we are a values-driven organization that empowers you to build a fulfilling career while bringing your authentic self to work every day. Our “Win as One” mentality ensures that our team’s success is directly linked to Client, Shareholder and Employee Satisfaction. Recognized by Newsweek as one of AMERICA’S MOST LOVED WORKPLACES® for three consecutive years and a Built In Best Places to Work for six years, we are committed to our employees’ total wellbeing. Enjoy competitive compensation, a flexible workplace, comprehensive benefits, and opportunities for professional growth. Bring your passion and talents to DFIN – because being YOU thrives here. Summary: We are looking for technical team members at all levels who want to push themselves to deliver best in market SaaS solutions. We offer a challenging environment where you will have to grow, adapt and use your skills consistently. Our customers rely on us in the moments that matter. Engineering delivers on that promise. The Principal Site Reliability Engineer is responsible for ensuring our SaaS products are fast, stable and optimized for our customers. SRE’s at DFIN take on availability, performance, managing change, monitoring, response and are guardians of non-functional requirements. You either have an SaaS infrastructure background with a programmatic, automated mindset or are someone that comes with a software engineering background with SaaS infrastructure experience. The SRE goal is to build automated systems that reduce or eliminate manual work to keep our products up and running and performing optimally. We are looking for someone who thrives on collaboration within the team and across other groups and can lead colleagues independently to deliver solutions to complex problems. Responsibilities: • Champion and implement a culture of SRE to maintain a high-quality platform infrastructure in DFIN SaaS products • Champion and implement application and infrastructure monitoring and alerting to prevent client impacting issues by ensuring system availability, performance and scalability to maintain SLOs and SLAs • Optimize application performance at scale • Automate everything including system operational runbooks • Define and support continuous integration and deployment pipelines (CI/CD) aligned to branching and quality assurance strategies • Dive deep into technology and stay on the forefront of the latest tools, technologies, and strategies; help evaluate, prototype, and integrate them into work processes • Perform with broad independence and deliver on project milestones and tasks on schedule while communicating progress regularly • Build strong relationships with SRE team members and software engineering teams to hold each other accountable for quality expectations • Learn continuously and apply lessons learned • Evangelize best practices, eliminate bottlenecks, and improve process • Participate in on-call duties 365/24/7 and lead the triage and RCA of production incidents Qualifications: • 8+ years experience writing software in any modern software language such as C# .NET, Java • 8+ years experience creating automated deployments with tools such as Harness, Azure DevOps, Ansible or Jenkins to manage Infrastructure as Code and software build and deployment in a continuous integration (CI) / continuous delivery (CD) environment • 5+ years experience as a global admin of Azure including cloud cost management • 5+ years experience implementing production performance, availability, and scalability monitoring and alerting using a tool such as New Relic, Dynatrace, DataDog or AppDynamics • 5+ years experience writing scripts in PowerShell or Python/Bash to automate system operations as runbooks for Windows or Linux environments. • 5+ years experience supporting public client facing revenue generating systems • Strong DevOps focus and experience building and deploying Infrastructure as Code with Terraform or similar technology • Experiencing monitoring and preventing issues with databases and database queries (SQL, Cosmos) using tools like Solarwinds Database Performance Analyzer, Idera SQL Diagnostic Manager, or Redgate SQL Monitor • Experience planning, coordinating, developing and executing all stages of post deployment verification test scripts • Experience securing Windows or Linux systems in 24x7 production environment • Experience with containerization and managing Kubernetes clusters (AKS or EKS) • Experience with common cloud networking, firewall and load balancing configuration • BS in Computer Science or equivalent work experience. It is the policy of Donnelley Financial Solutions to select, place, and manage all its employees without discrimination based on race, color, national origin, gender, age, religion, actual or perceived disability, veteran's status, actual or perceived sexual orientation, genetic information or any other protected status. If you are a qualified individual with a disability or a disabled veteran, you have the right to request a reasonable accommodation if you are unable or limited in your ability to use or access jobs.dfinsolutions.com as a result of your disability. You can request a reasonable accommodation by sending an email to [email protected]. At DFIN, protecting your identity is a top priority. Please be aware of scammers impersonating DFIN recruiters. DFIN recruiters will never request personal information via email or text. You will only receive a text from us if you've already been in contact. All automated messages will come from [email protected]. If you ever have doubts about the legitimacy of any communication from us, please do not hesitate to reach out for verification via [email protected] (this email is for general TA questions and is not used for updates on your application status). Apply Job!

You might like

Part-Time Student - Software Engineer - Champaign, IL

Work from home Full-time role

Campbell Global Forester

Work from home Full-time role

Remote Life Insurance Agent

Work from home Full-time role

Business Relationship Support Representative - CSO 2 WellsOne

Work from home Full-time role

Part-Time Universal Banker (20 Hours)- Bilingual English/Spanish Preferred- Biscayne Branch

Work from home Full-time role

Commercial Card Modernization Product Director, Payments, Executive Director

Work from home Full-time role

AVP - Advance Markets Compliance

Work from home Full-time role

Principal Onboarding Lead, Sales & Success

Work from home Full-time role

Field Based Patient Care Coordinator - Multiple Locations

Work from home Full-time role

Associate Customer Care Representative

Work from home Full-time role

Category Specialist (Corporate)

Work from home Full-time role

RN / Medical Reviewer / Remote

Work from home Full-time role

Data Analyst, Away from Home Sales Operations Team

Work from home Full-time role

Employee Relations Manager

Work from home Full-time role

Experienced Remote Data Entry Specialist – Aviation and Travel Industry

Work from home Full-time role

Transformative Salesforce Project Manager (Remote)

Work from home Full-time role

Part-time Chat Specialist

Work from home Full-time role

Experienced Full Stack Chat Moderator – Community Messaging Assistant | $25–$35/hr | Fully Remote, No Phone Calls, No Experience Needed

Work from home Full-time role

Technical Project Manager

Work from home Full-time role

Senior Product Manager – Internal Systems (Contract, Remote)

Work from home Full-time role