San Francisco, California, United States

Cognizant is hiring a Site Reliability Engineer

Responsibilities

  • Develop software solutions to automate manual operational tasks across the lifecycle.
  • Diagnose and resolve critical incidents, lead post-incident reviews without blame, and implement fixes to prevent recurrence.
  • Work closely with development teams from design through deployment to ensure systems are built for reliability and scale.
  • Analyze application behavior and metrics to define meaningful service level objectives.
  • Create patterns that enable systems to self-recover and withstand failures.
  • Build automated solutions for software updates, configuration changes, and product releases.
  • Partner with senior engineers and provide guidance to less experienced team members.
  • Architect, launch, and oversee cloud environments on AWS with an emphasis on automation, growth capacity, and protection.
  • Implement and manage infrastructure using code with tools like Terraform.
  • Continuously monitor system performance, uptime, and security, applying observability principles to enhance system health.
Required Skills
AWSTerraformPythonBashCI/CDJenkinsDockerKubernetesInfrastructure as CodeAutomationCloud Infrastructure
About company
Cognizant
Cognizant is a global technology and professional services leader, empowering clients to reimagine processes, innovate and transform in a rapidly evolving digital landscape. In the UK, we work with major public sector bodies to deliver secure, scalable, citizen-centred digital services that make a real impact.
All jobs at Cognizant Visit website
Job Details
Department Information Technology
Category infrastructure
Posted 4 months ago