Dearborn, Michigan, United States On-site

Ford is hiring a SRE Engineer

Ford Motor Company is seeking a Site Reliability Engineer to join our team. In this role, you will be instrumental in elevating the performance and dependability of our Marketing and Sales Tech platform and applications. Your work will directly impact the smooth operation and evolutionary growth of our technology landscape.

What You'll Do

  • Participate in a 24/7 on-call rotation, providing rapid response to critical incidents.
  • Diagnose, troubleshoot, and resolve complex production issues to reduce Mean Time to Recovery.
  • Execute and contribute to the continuous improvement of operational runbooks and Standard Operating Procedures.
  • Lead and participate in blameless post-mortems and Root Cause Analysis sessions.
  • Partner with cross-functional teams to architect long-term reliability solutions.
  • Define and track Service Level Indicators and Objectives to measure service health.
  • Collaborate with Product Owners to establish service levels and manage Error Budgets.
  • Provide critical analysis during monthly release reviews on service health impact.
  • Leverage and optimize Ford’s observability suite to monitor system health and proactively identify anomalies.
  • Identify observability blind spots and implement solutions for comprehensive system visibility.
  • Manage metric collection, dashboard creation, and alert definitions using Terraform.
  • Design robust notification strategies and thresholds for KPI/SLO violations.
  • Champion automation by developing scripts, tools, and streamlined workflows to eliminate manual tasks.
  • Design and implement self-healing mechanisms to automatically remediate common failures.
  • Implement and manage AI-driven observability solutions for proactive monitoring.
  • Coordinate with platform and engineering teams to resolve production bottlenecks.
  • Deliver clear, data-driven status reports on system health and SRE initiatives to leadership.

What We're Looking For

  • Bachelor’s degree in computer science or a related field.
  • Minimum of 5+ years of professional experience in Site Reliability Engineering or DevOps.
  • Deep hands-on experience with Google Cloud Platform, specifically Cloud Run, GKE, and OpenShift.
  • Advanced proficiency in Terraform, including writing reusable modules and automating infrastructure.
  • Experience in comprehensive system observability using primary telemetry – Metrics, Events, Logs and Traces.
  • Hands-on experience with Dynatrace or similar APM tools for distributed tracing and profiling.
  • Proficiency in at least one high-level programming language (Java, Node.js, Python, or Go).
  • Proven experience managing high-severity incidents and the full incident lifecycle.

Technical Stack

  • Google Cloud Platform (GCP), Cloud Run, GKE, OpenShift
  • Terraform
  • Dynatrace
  • Java, Node.js, Python, Go

Team & Environment

You will collaborate with diverse teams across the organization, including cross-functional development and platform teams.

Work Mode

This position is onsite.

Ford is an equal opportunity employer.

Required Skills
Google Cloud PlatformCloud RunGKEOpenShiftTerraformDynatraceJavaNode.jsPythonGoSite Reliability EngineeringDevOpsInfrastructure as CodeSystem Observability Google Cloud PlatformCloud RunGKEOpenShiftTerraformDynatraceJavaNode.jsPythonGoSite Reliability EngineeringDevOpsInfrastructure as CodeSystem Observability
Looking for a remote dev community?

200+ professionals, 37 countries, one network

Working remotely doesn't mean working alone. Iglu connects you with developers, designers, and digital experts worldwide. Collaborate, learn, and grow together.

Global professional network
Knowledge sharing & collaboration
Regular community events
Cross-project opportunities
Join the community
37 countries represented
About company
Ford
Ford Motor Company is an established global automotive manufacturer building a better world through innovative, exciting, and sustainable products and services. The company advances technologies in autonomy, electrification, and smart mobility.
All jobs at Ford Visit website
Job Details
Department Engineering
Category infrastructure
Posted 2 months ago