Hybrid

Bjak is hiring a MLOps 工程师 (MLOps Engineer)

BJAK is looking for an MLOps Engineer focused on building and scaling impactful AI solutions. In this role, you will run and optimize state-of-the-art open-source models, ensuring they are safe, trustworthy, and performant at scale. You will collaborate closely with cross-functional teams across product, engineering, operations, infrastructure, and data.

What You'll Do

  • Run and manage open-source models efficiently, optimizing for cost and reliability.
  • Ensure high performance and stability across GPU, CPU, and memory resources.
  • Monitor and troubleshoot model inference to maintain low latency and high throughput.
  • Collaborate with engineers to implement scalable and reliable model serving solutions.

What We're Looking For

  • Experience with model serving platforms such as vLLM or HuggingFace TGI.
  • Proficiency in GPU orchestration using tools like Kubernetes, Ray, Modal, RunPod, or LambdaLabs.
  • Ability to monitor latency, costs, and scale systems efficiently with traffic demands.
  • Experience setting up inference endpoints for backend engineers.

Technical Stack

  • vLLM, HuggingFace TGI
  • Kubernetes, Ray, Modal, RunPod, LambdaLabs

Team & Environment

You will work in a flat structure, collaborating closely with regional teams across product, engineering, operations, infrastructure, and data.

Benefits & Compensation

  • Health, dental & vision insurance.
  • Global travel insurance (for you & your dependents).
  • Unlimited, flexible time off.
  • Housing rental subsidies.
  • Quality company cafeteria.
  • Overtime meals.

Work Mode

This is a hybrid role. It is a global position, with the company headquarters located in Malaysia.

BJAK values speed, clarity, and relentless ownership. Our high-density, high-performance team is focused on high-quality work and global impact.

Required Skills
vLLMHuggingFace TGIKubernetesRayModalRunPodLambdaLabsMLOpsMachine LearningInfrastructureCloudDistributed SystemsModel ServingDevOps vLLMHuggingFace TGIKubernetesRayModalRunPodLambdaLabsMLOpsMachine LearningInfrastructureCloudDistributed SystemsModel ServingDevOps
Need to work legally in Thailand?

Work permits without the paperwork nightmare

Thai immigration rules are strict and easy to get wrong. SVBL handles the bureaucracy — correct visa type, proper documentation, timely submissions. You focus on your work.

Right visa type for your situation
Document preparation & submission
Deadline tracking & renewals
Direct liaison with immigration
Talk to an expert
10+ years experience
About company
Bjak
Bjak is focused on providing access to affordable and sustainable financial services for people in ASEAN. Headquartered in Malaysia, Bjak is the largest insurance portal in Southeast Asia. Their main portal, Bjak.com, helps millions find the insurance policy with the best value and highest coverage. They invest in technology such as Custom API, trading systems, and data science to enable easy access to financial services.
All jobs at Bjak Visit website
Job Details
Category infrastructure
Posted 5 months ago