Islamabad, Islamabad Capital Territory, Pakistan On-site

Skylabs AI is hiring a Senior Backend Engineer | Skylabs AI | Jobs By Workable

Skylabs AI is seeking a Senior Backend Engineer to lead the architecture and implementation of our entire cloud-native infrastructure for AI agents and developer tools. You will build a scalable, resilient, and secure microservices platform that powers the next generation of intelligent software.

What You'll Do

  • Design, build, deploy, and maintain the platform as a set of resilient, scalable microservices (e.g., auth, payments, agent orchestration).
  • Build the critical systems that manage the lifecycle of long-running agentic tasks, including state management and asynchronous communication.
  • Architect and implement containerized execution environments (Docker, Kubernetes) where agents can safely build, test, and run code.
  • Deploy, manage, and wrap high-throughput LLM inference servers (NVIDIA Triton, vLLM) to serve models to our agent systems.
  • Design, secure, and manage core APIs (REST and gRPC), including public-facing MCP/ACP and internal service-to-service communication.
  • Build the high-throughput data ingestion pipeline to process and store massive volumes of telemetry and training data from IDE clients.
  • Implement and manage authentication (OAuth2/OIDC) and payment/subscription (Stripe) systems.
  • Own Infrastructure as Code (Terraform) and build out comprehensive observability (Prometheus, Grafana, Jaeger) across the entire stack.

What We're Looking For

  • Proven ability to design, build, deploy, and maintain a complex system as a set of resilient, scalable, and independent microservices.
  • Mastery of designing clean, secure, and high-performance APIs (both REST and gRPC). Experience with API gateways, versioning, and documentation.
  • Deep practical experience with both SQL (e.g., PostgreSQL, MySQL) and NoSQL (e.g., MongoDB, DynamoDB, Redis) databases, including data modeling, query optimization, and scaling.
  • Experience building and scaling backend systems specifically for LLM use cases, understanding the unique demands of stateful, long-running agentic tasks.
  • Experience building and managing full-stack observability (e.g., using Prometheus, Grafana, ELK/OpenSearch, and distributed tracing like Jaeger).
  • Expertise in implementing robust identity systems, including sign-up, OAuth2/OIDC, JWTs, and fine-grained Role-Based Access Control (RBAC).
  • Experience integrating and managing payment and subscription systems (e.g., Stripe), including metering and subscription logic.
  • Experience designing systems that manage the lifecycle of complex, long-running 'agent' tasks.
  • Deep experience with Docker and Kubernetes (K8s) for orchestrating 'remote sandbox' environments.
  • Expertise in using systems like Kafka, RabbitMQ, or gRPC streams to manage communication between microservices, the IDE plugin, and the AI agents.
  • Hands-on experience deploying and managing high-throughput LLM inference servers like NVIDIA Triton, TGI, or vLLM on GPU-enabled infrastructure.
  • Experience building the backend service that securely wraps inference endpoints, handles request batching, and serves them to the agent orchestrator.
  • Experience designing and building a high-throughput data pipeline to receive, process, and store telemetry and training data.
  • Mastery of Terraform or CloudFormation to provision and manage the entire cloud infrastructure.
  • Experience implementing rate limiting, request validation, and service-to-service authentication (e.g., mTLS).
  • A strong DevOps mindset and experience building automated CI/CD pipelines (e.g., GitLab CI, GitHub Actions) for a microservices environment.

Nice to Have

  • Familiarity with lightweight virtualization (e.g., Firecracker) for remote sandboxing.

Technical Stack

  • Docker, Kubernetes, Kafka, RabbitMQ, gRPC, NVIDIA Triton, vLLM
  • PostgreSQL, MySQL, MongoDB, DynamoDB, Redis
  • OAuth2/OIDC, JWT, Stripe
  • Terraform, CloudFormation, Prometheus, Grafana, Jaeger, ELK/OpenSearch
  • GitLab CI, GitHub Actions, Firecracker

Team & Environment

Work alongside world-class AI researchers and engineers.

Benefits & Compensation

  • Competitive salary in USD
  • Comprehensive health allowance
  • Relocation allowance (if moving to Islamabad)
  • Monthly team events and offsites
  • A beautiful, collaborative office space

Work Mode

This is an onsite position located in Islamabad.

Skylabs AI is an equal opportunity employer.

Required Skills
DockerKubernetesKafkaRabbitMQgRPCNVIDIA TritonvLLMPostgreSQLMySQLMongoDBPythonFastAPISystem DesignMachine LearningCI/CD DockerKubernetesKafkaRabbitMQgRPCNVIDIA TritonvLLMPostgreSQLMySQLMongoDBPythonFastAPISystem DesignMachine LearningCI/CD
Relocating to Thailand?

Visa and work permit handled by experts

SVBL manages your entire visa process — from application to approval. Work permits, extensions, and compliance all covered. One partner for legal, immigration, and settling in.

Work permit processing
Visa extensions & renewals
Immigration compliance
Banking & housing guidance
Get free consultation
Free initial consultation
About company
Skylabs AI
A US-headquartered company conducting applied research on AI for intelligent reasoning. Specializes in complex neurosymbolic AI to solve intricate problems within software engineering.
All jobs at Skylabs AI Visit website
Job Details
Category backend
Posted 4 months ago