United States Hybrid

Cognizant is hiring a Site Reliability Engineer

Responsibilities

  • Lead the management and tuning of Splunk systems to maintain consistent performance and uptime.
  • Guide the adoption of Site Reliability Engineering principles to improve system stability and growth capacity.
  • Offer technical leadership in setting up and managing Grafana dashboards for live system monitoring and data display.
  • Work with diverse teams to deploy ELK stack tools for efficient log handling and insight generation.
  • Use Dynatrace AppMon to track application health and resolve performance concerns before escalation.
  • Create and execute plans to boost infrastructure effectiveness and minimize service interruptions.
  • Perform ongoing evaluations of system performance to detect opportunities for enhancement.
  • Promote technical advancement by exploring and adopting emerging tools and methods.
  • Ensure infrastructure operations follow recognized industry standards and recommended procedures.
  • Organize training and knowledge transfer activities to strengthen team expertise.
  • Support initiatives in the media sector by applying technical skills to refine system architectures.
  • Advance organizational goals by improving system dependability, leading to better media service delivery.
  • Keep updated records of system setups and operational workflows for continuity and reference.

Work Arrangement

Hybrid

Required Skills
GrafanaELK StackAWS CloudWatchAWS LambdaMonitoring
About company
Cognizant
Cognizant is a global technology and professional services leader, empowering clients to reimagine processes, innovate and transform in a rapidly evolving digital landscape. In the UK, we work with major public sector bodies to deliver secure, scalable, citizen-centred digital services that make a real impact.
All jobs at Cognizant Visit website
Job Details
Department Information Technology
Category infrastructure
Posted 4 months ago