San Francisco, California, United States Hybrid USD 194,000 - 267,000 Yearly

Okta is hiring a Site Reliability Engineer

Okta is seeking a Site Reliability Engineer focused on Observability to own and expand our Observability ecosystem into Google Cloud. In this role, you will build a scalable Observability Platform using infrastructure as code and automation.

What You'll Do

  • Design, build, and maintain scalable observability infrastructure using tools like Terraform.
  • Optimize the collection, processing, and storage of Observability data to ensure high reliability and low latency of Splunk and Grafana services.
  • Participate in on-call rotations and lead post-incident reviews to drive systemic improvements.
  • Automate the deployment and scaling of observability agents and collectors to eliminate toil.

What We're Looking For

  • Minimum 5+ years experience scaling and managing observability in a Google Cloud platform.
  • Expertise in creating intuitive, actionable Splunk or Grafana dashboards that correlate data across multiple sources.
  • Minimum 3+ years of experience in an SRE, DevOps, or Systems Engineering role with a focus on high-availability systems.
  • Strong coding skills in Python or Go for building internal tools and automating workflows.
  • Deep understanding of Linux internals, networking (TCP/IP, DNS, Load Balancing), and container orchestration (Kubernetes/GKE).
  • A data-driven approach to debugging complex, cross-service performance bottlenecks.
  • U.S. Person status (e.g., U.S. Citizen, National, Lawful Permanent Resident, Refugee, or Asylee).

Nice to Have

  • Hands-on experience with OpenTelemetry (OTel), Vector, or similar frameworks for instrumenting applications.
  • Experience in migrating Splunk to Grafana Loki.
  • Experience managing observability native tools within AWS.

Technical Stack

  • Terraform, Go, Python, Ruby
  • Google Cloud Platform, Splunk, Grafana
  • Kubernetes, GKE, Linux
  • OpenTelemetry, Vector, AWS

Benefits & Compensation

  • Compensation: $194,000—$267,000 USD (San Francisco Bay Area) + equity where applicable.
  • Health, dental, and vision insurance.
  • 401(k), flexible spending account.
  • Paid time off (PTO), parental leave.
  • Equity.

Work Mode

This is a hybrid role based in the San Francisco Bay Area.

Okta is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, ancestry, marital status, age, physical or mental disability, or status as a protected veteran. We also consider for employment qualified applicants with arrest and convictions records, consistent with applicable laws.

Required Skills
TerraformGoPythonRubyGoogle Cloud PlatformSplunkGrafanaKubernetesGKELinuxNetworkingTCP/IPDNSLoad BalancingObservability TerraformGoPythonRubyGoogle Cloud PlatformSplunkGrafanaKubernetesGKELinuxNetworkingTCP/IPDNSLoad BalancingObservability
Scaling your freelance income?

Invoice multiple clients effortlessly

Managing 3+ international clients? Glopay streamlines everything. One EU company, unlimited invoices, automatic compliance. You just send and get paid.

Unlimited clients & invoices
Multi-currency support
Automated tax compliance
Client portal for easy payments
Scale with Glopay
Trusted by 10,000+ freelancers
About company
Okta
Okta is a leading identity and access management platform that helps organizations secure their digital interactions and manage user authentication across various systems and applications.
All jobs at Okta Visit website
Job Details
Department Engineering
Category infrastructure
Posted 2 months ago