Role OverviewWe are looking for a Senior Software Engineer to lead the design and implementation of scalable cloud platform services. You will focus on distributed systems, container orchestration, and backend infrastructure that powers global operations across multiple datacenters.
Key Responsibilities
- Architect and maintain core components such as compute orchestration, API gateways, and multi-tenancy systems using Go, Java, Python, or Rust.
- Design and optimize RESTful and gRPC APIs to support cloud resource provisioning, lifecycle management, and monitoring.
- Build fault-tolerant, high-throughput distributed systems deployed across multiple regions.
- Implement service mesh configurations, rate limiting, and secure authentication and authorization frameworks.
- Write clean, tested, and documented code following CI/CD, TDD, and peer review practices.
- Enhance container orchestration efficiency and automate operational workflows to improve system reliability.
- Apply engineering leadership principles to guide project execution and improve team performance.
- Advance cloud-native capabilities through innovation in Kubernetes, Docker, and infrastructure automation.
- Develop Infrastructure-as-Code solutions using Terraflow or Pulumi for consistent environment provisioning.
- Design and manage data access layers using PostgreSQL, MySQL, MongoDB, DynamoDB, Redis, and Memcached, with optimized query and indexing strategies.
- Implement data migration, backup, and recovery processes tailored for multi-tenant environments.
- Establish observability with distributed tracing, structured logging, and metrics using Prometheus, Grafana, Jaeger, and the ELK stack.
- Create alerting systems and define SLI/SLO frameworks to ensure service reliability.
- Conduct performance analysis, load testing, and capacity planning to maintain scalability.
- Participate in on-call rotations and lead incident response with root cause analysis and post-mortems.
- Collaborate with AI, product, and infrastructure teams to deliver integrated technical solutions.
- Lead large-scale engineering initiatives, from planning to execution, in cloud computing domains.
- Contribute to architecture reviews, code quality standards, and technical mentorship for junior engineers.
- Evaluate emerging tools and technologies to continuously evolve platform capabilities.
Required Qualifications
- Bachelor’s or Master’s degree in Computer Science, Software Engineering, Electrical Engineering, or a related field.
- At least 5 years of professional software engineering experience, with 3+ years focused on cloud platforms, distributed systems, or scalable backend services.
- Strong command of backend languages such as Go, Java, Python, or Rust, with proven experience in production cloud environments.
- Deep expertise in Kubernetes, Docker, microservices, and service mesh technologies like Istio, Envoy, or Linkerd.
- Fundamental understanding of distributed systems: consensus, sharding, replication, eventual consistency, and fault tolerance.
- Hands-on experience with AWS, GCP, or Azure in large-scale infrastructure design and operations.
- Proficiency with databases including PostgreSQL, MySQL, MongoDB, and Redis, including modeling, optimization, and storage design.
- Experience with CI/CD pipelines (GitHub Actions, Jenkins, ArgoCD) and Infrastructure-as-Code tools (Terraform, Pulumi).
- Familiarity with observability stacks (Prometheus, Grafana, Jaeger, ELK) and SRE methodologies.
- Strong analytical, communication, and collaboration skills, with experience working in globally distributed teams.
Preferred Qualifications
- Knowledge of cloud security practices, including OAuth, JWT, RBAC, encryption, and compliance standards like SOC 2 or ISO 27001.
- Experience with AI/ML infrastructure, including GPU scheduling, model serving, and training pipelines.
- Fluency in English; additional proficiency in Chinese is beneficial.
- Background in multinational or cross-cultural work environments.
Work Environment
This role supports a globally distributed team, allowing flexible remote collaboration across regions including the United States, Bhutan, Norway, Canada, Malaysia, and Ethiopia. The company fosters an inclusive culture that values diverse perspectives, open communication, and personal growth. You’ll work in a dynamic, fast-growing environment with opportunities to shape systems from the ground up and contribute directly to the evolution of digital infrastructure.
Benefits
- Supportive, respectful workplace with a startup mindset
- Opportunities for autonomy, accountability, and rapid professional development
- Access to training, mentorship, and collaboration with technical pioneers
- Comprehensive welfare and growth-focused benefits


