About the Role
This role involves building and managing cloud-native systems with a focus on automation, reliability, and efficient deployment pipelines across distributed environments.
Responsibilities
- Design and manage Kubernetes clusters for production workloads
- Develop infrastructure-as-code templates using tools like Terraform or CloudFormation
- Automate deployment pipelines using CI/CD platforms such as Jenkins or GitLab CI
- Monitor system performance and troubleshoot issues across distributed environments
- Ensure platform security and compliance with organizational standards
- Collaborate with development teams to streamline release processes
- Optimize containerized applications for scalability and efficiency
- Maintain high availability and fault tolerance in cloud environments
- Implement logging and observability solutions across services
- Support incident response and root cause analysis for production outages
- Manage configuration management tools such as Ansible, Chef, or Puppet
- Integrate security best practices into deployment workflows
- Scale infrastructure to meet growing service demands
- Document system architecture and operational procedures
- Evaluate and adopt new DevOps tools and technologies
- Work closely with SRE and platform teams to improve system resilience
- Participate in on-call rotations for critical system support
- Ensure consistent environments across development, staging, and production
- Drive automation initiatives to reduce manual intervention
- Contribute to disaster recovery planning and execution
Nice to Have
- Experience with large-scale Kubernetes deployments
- Certifications in cloud or DevOps technologies
- Familiarity with service mesh tools like Istio or Linkerd
- Background in telecommunications or media industries
- Experience with multi-region or hybrid cloud setups
- Knowledge of compliance frameworks such as SOC or ISO
- Contributions to open-source DevOps projects
- Advanced scripting or programming experience
- Experience with database operations in cloud environments
- Understanding of regulatory requirements for data handling
Compensation
Competitive salary and benefits package
Work Arrangement
Hybrid work model with partial remote flexibility
Team
Part of a large-scale engineering organization focused on cloud infrastructure and platform reliability
Why This Role Matters
- This position plays a critical role in maintaining the reliability and scalability of services used by millions of customers daily.
- Engineers in this role directly influence platform stability, deployment speed, and operational efficiency.
Growth Opportunities
- Opportunities exist to lead automation initiatives, mentor junior engineers, and contribute to strategic infrastructure planning.
- Team members are encouraged to pursue certifications and attend technical conferences.
Sponsorship available for qualified candidates


