About the Role
This position leads DevOps initiatives by designing and managing CI/CD systems, cloud infrastructure, and operational workflows to enhance system reliability and development velocity.
Responsibilities
- Lead the design and implementation of cloud infrastructure on major platforms
- Architect and maintain scalable CI/CD pipelines for automated deployments
- Drive adoption of infrastructure-as-code practices across engineering teams
- Oversee system reliability, monitoring, and incident response protocols
- Collaborate with development teams to optimize application performance
- Implement security controls within deployment workflows and cloud environments
- Manage containerization strategies using Docker and orchestration via Kubernetes
- Establish and enforce configuration management standards
- Lead capacity planning and cost optimization for cloud resources
- Mentor engineers in DevOps principles and tooling
- Evaluate and integrate new technologies to improve system efficiency
- Ensure compliance with organizational and regulatory standards
- Coordinate with security teams to maintain robust system defenses
- Troubleshoot complex production issues across distributed systems
- Promote observability through logging, monitoring, and alerting systems
- Support disaster recovery planning and execution
- Facilitate post-mortem analyses to improve operational resilience
- Drive automation initiatives to reduce manual operational tasks
- Participate in on-call rotations for critical systems
- Document system architecture and operational procedures
- Lead cross-team initiatives to standardize DevOps practices
- Optimize deployment frequency and rollback capabilities
- Ensure high availability and fault tolerance in production environments
- Work closely with product teams to align infrastructure with business goals
- Foster a culture of shared ownership and continuous delivery
Compensation
Competitive salary and benefits package
Work Arrangement
Hybrid work model with flexible scheduling
Team
Collaborative engineering environment focused on continuous improvement
Technology Stack
- Primary cloud platform: AWS
- Container orchestration: Kubernetes (EKS)
- Infrastructure provisioning: Terraform
- CI/CD: GitLab CI and GitHub Actions
- Monitoring: Prometheus, Grafana, and ELK stack
Team Mission
- Enable rapid, safe, and repeatable deployments
- Reduce operational toil through automation
- Ensure systems are resilient and observable
- Empower development teams with self-service tools
Available for qualified candidates


