Remote (Global)

Tether Operations Limited is hiring a Senior AI Inference Engineer (100% Remote)

About the Role

What You'll Do

Own the development and optimization of the inference infrastructure for on-device AI, ensuring models perform efficiently and consistently across diverse hardware. You'll focus on runtime quality, fine-tuning system behavior for fast startup, low memory pressure, and balanced throughput and latency during extended use.

Work directly with machine learning models using frameworks like llama.cpp, ggml, and ONNX, deploying them to edge environments with a strong emphasis on performance and reliability. Partner with research teams to bridge the gap between experimental models and production-ready implementations, helping refine models for real-world deployment.

Integrate advanced AI capabilities into existing software products, ensuring seamless performance and alignment with user privacy by design.

Requirements

  • Strong proficiency in C++ with a focus on systems-level programming and runtime efficiency
  • Hands-on experience deploying machine learning models to edge or resource-constrained devices
  • Familiarity with inference frameworks such as llama.cpp, ggml, and ONNX
  • Excellent written and verbal communication skills in English
  • Ability to collaborate across disciplines, especially with research and product teams

Benefits

  • Work 100% remotely from anywhere in the world
  • Collaborate with a lean, high-impact team at the forefront of fintech innovation
  • Contribute to a transparent, globally distributed organization committed to technological empowerment
  • Be part of a mission-driven effort advancing blockchain-based financial systems
Required Skills
C++llama.cppggmlonnxAI inferencemachine learning optimizationmodel quantizationperformance optimization C++llama.cppggmlonnxAI inferencemachine learning optimizationmodel quantizationperformance optimization
Looking for a remote dev community?

200+ professionals, 37 countries, one network

Working remotely doesn't mean working alone. Iglu connects you with developers, designers, and digital experts worldwide. Collaborate, learn, and grow together.

Global professional network
Knowledge sharing & collaboration
Regular community events
Cross-project opportunities
Join the community
37 countries represented
About company
Tether Operations Limited
Pioneers a global financial revolution with cutting-edge solutions empowering businesses to integrate reserve-backed tokens across blockchains. Product suite includes the USDT stablecoin, energy solutions for Bitcoin mining, data solutions for AI and P2P tech, digital education, and ventures at the intersection of technology and human potential.
All jobs at Tether Operations Limited Visit website
Job Details
Department Data
Category data
Posted 2 months ago