United States Remote (Global)

DFIN | Financial Risk, Compliance and Software Solutions is hiring a Site Reliability Engineer

DFIN is looking for a Principal Site Reliability Engineer - Cloud to ensure our SaaS product infrastructure is fast, cost-effective, stable, and optimized for our customers. As a guardian of non-functional requirements, you will be responsible for designing, building, securing, monitoring, and maintaining our cloud platform. SREs at DFIN take ownership of availability, performance, change management, monitoring, and incident response.

What You'll Do

  • Champion and implement a culture to maintain performant, reliable, secure, cost-effective platform cloud infrastructure in DFIN SaaS products based on operationalized processes you define.
  • Champion security of our cloud infrastructure collaborating with Security and Governance teams and using static and dynamic tooling.
  • Champion and implement application and cloud infrastructure monitoring and alerting to prevent client impacting issues by ensuring system availability, performance and scalability to maintain SLOs and SLAs.
  • Optimize cloud infrastructure and application performance at scale while maintaining effective cost controls.
  • Automate cloud infrastructure buildout and maintenance including system operational runbooks.
  • Dive deep into technology and stay on the forefront of the latest tools, technologies, and strategies; help evaluate, prototype, and integrate them into operationalized work processes.
  • Perform with broad independence and deliver on project milestones and tasks you define on schedule while communicating progress regularly.
  • Build strong relationships with SRE team members and software engineering teams to hold each other accountable for quality expectations.
  • Learn continuously and apply lessons learned.
  • Evangelize best practices, eliminate bottlenecks, and improve process.
  • Participate in on-call duties 365/24/7 and lead the triage and RCA of production incidents.

What We're Looking For

  • 8+ years experience designing, building, securing, monitoring and maintaining cloud infrastructure in Azure or AWS.
  • 5+ years experience creating, configuring, maintaining and monitoring Kubernetes clusters (AKS or EKS) in cloud infrastructure to optimize application performance and reliability.
  • 5+ years building and deploying Infrastructure as Code with Terraform or similar technology.
  • 5+ years experience with common cloud networking, firewall and load balancing configuration.
  • 5+ years experience writing software in any modern software language such as C# .NET, Java.
  • 5+ years experience creating automated deployments with tools such as Harness, Azure DevOps, Ansible or Jenkins to manage Infrastructure as Code and software build and deployment in a continuous integration (CI) / continuous delivery (CD) environment.
  • 5+ years experience implementing production performance, availability, and scalability monitoring and alerting using a tool such as New Relic, Dynatrace, DataDog or AppDynamics.
  • 5+ years experience supporting public client facing revenue generating systems.
  • Experience monitoring and preventing issues with databases and database queries (SQL) using tools like Solarwinds Database Performance Analyzer, Idera SQL Diagnostic Manager, or Redgate SQL Monitor.
  • Experience planning, coordinating, developing and executing all stages of post deployment verification test scripts.
  • Experience securing Windows or Linux systems in 24x7 production environment.
  • BS in Computer Science or equivalent work experience.

Technical Stack

  • Cloud Providers: Azure, AWS
  • Orchestration: Kubernetes (AKS, EKS)
  • Infrastructure as Code: Terraform
  • Languages: C# .NET, Java
  • CI/CD & Automation: Harness, Azure DevOps, Ansible, Jenkins
  • Monitoring & Observability: New Relic, Dynatrace, DataDog, AppDynamics
  • Database Monitoring: Solarwinds Database Performance Analyzer, Idera SQL Diagnostic Manager, Redgate SQL Monitor, SQL

Benefits & Compensation

  • Competitive compensation
  • Flexible workplace
  • Comprehensive benefits
  • Opportunities for professional growth

Work Mode

This is a global position.

It is the policy of Donnelley Financial Solutions to select, place, and manage all its employees without discrimination based on race, color, national origin, gender, age, religion, actual or perceived disability, veteran status, actual or perceived sexual orientation, genetic information or any other protected status.

Required Skills
AzureAWSKubernetesAKSEKSTerraformC# .NETJavaHarnessAzure DevOpsAnsibleJenkinsCloud InfrastructureMonitoringNetworking AzureAWSKubernetesAKSEKSTerraformC# .NETJavaHarnessAzure DevOpsAnsibleJenkinsCloud InfrastructureMonitoringNetworking
Need to work legally in Thailand?

Work permits without the paperwork nightmare

Thai immigration rules are strict and easy to get wrong. SVBL handles the bureaucracy — correct visa type, proper documentation, timely submissions. You focus on your work.

Right visa type for your situation
Document preparation & submission
Deadline tracking & renewals
Direct liaison with immigration
Talk to an expert
10+ years experience
About company
DFIN | Financial Risk, Compliance and Software Solutions
Delivers innovative software and service solutions for essential financial reporting and capital markets transactions.
All jobs at DFIN | Financial Risk, Compliance and Software Solutions Visit website
Job Details
Department Information Technology
Category infrastructure
Posted 2 months ago