About the Role
The role involves leading the development and operation of foundational systems infrastructure, improving system reliability, and supporting scalable architectures across production environments.
Responsibilities
- Design and implement scalable backend systems and infrastructure components
- Own the reliability, performance, and uptime of core services
- Diagnose and resolve complex system-level issues in production environments
- Collaborate with engineering teams to optimize service architecture
- Develop automation tools to improve deployment and operational efficiency
- Maintain system security, monitoring, and incident response protocols
- Lead root cause analysis for critical outages and performance bottlenecks
- Ensure infrastructure complies with operational and security standards
- Drive improvements in observability and telemetry systems
- Mentor engineers in best practices for systems design and operations
- Evaluate and integrate new technologies into the infrastructure stack
- Support disaster recovery and business continuity planning
- Optimize resource utilization and cost-efficiency of cloud infrastructure
- Contribute to capacity planning and scalability forecasting
- Enforce configuration management and infrastructure-as-code practices
- Participate in on-call rotations for critical system support
- Improve CI/CD pipelines for faster and safer deployments
- Collaborate on cross-team initiatives requiring system-level expertise
- Document system architecture and operational procedures
- Promote a culture of operational excellence and proactive problem solving
Nice to Have
- Master’s degree in a technical discipline
- Experience with real-time or low-latency systems
- Contributions to open-source infrastructure projects
- Knowledge of service mesh and API gateway technologies
- Experience in regulated or compliance-heavy environments
- Background in performance benchmarking and profiling
- Familiarity with hardware provisioning and data center operations
- Experience with zero-downtime deployment strategies
- Understanding of cryptographic protocols and key management
- Prior work on global-scale distributed platforms
Compensation
Competitive salary with equity and benefits package
Work Arrangement
Hybrid work model with flexibility for remote and in-office collaboration
Team
Collaborative engineering team focused on infrastructure resilience and system innovation
Technology Stack
- Primary languages include Go and Rust; infrastructure runs on Kubernetes with cloud-agnostic design principles
- Monitoring stack built on Prometheus, Grafana, and custom telemetry tools
Culture & Values
- Emphasis on transparency, technical rigor, and sustainable engineering practices
- Commitment to inclusive collaboration and continuous learning
Available for qualified candidates

