Shape the reliability of a global API platform
We’re seeking a Senior Site Reliability Engineer to help evolve and maintain the performance and availability of our distributed systems. In this role, you’ll use real-time insights from large-scale data to drive automation, optimize infrastructure, and enhance system resilience across our platform.
What You’ll Do
- Design and implement automated solutions to improve system performance and reduce operational overhead
- Analyze system behavior at scale to proactively identify and resolve reliability challenges
- Collaborate with engineering teams to strengthen infrastructure and support seamless integration of services
- Champion best practices in monitoring, observability, and incident response
What We’re Looking For
- Proven experience as a Site Reliability Engineer with a track record of improving system robustness
- Ability to think critically and propose innovative solutions that challenge conventional approaches
- Strong communication skills and a collaborative mindset, with a focus on shared ownership
- Deep understanding of reliability engineering principles, including scalability, fault tolerance, and performance tuning
How We Work
This is a fully remote role with team members across the globe. We operate on a remote-first model that prioritizes flexibility, autonomy, and personal accountability. You’ll have the freedom to manage your time and workload while contributing to a culture built on trust and transparency.
Benefits include unlimited paid time off, full location independence, and a work environment that values initiative and ownership. We expect high standards and, in return, offer the space and support to meet them your way.


