We are seeking Staff and Principal AI Engineers to lead the development of advanced speech modeling systems. You will design, train, and deploy machine learning models that power real-time conversational AI, with a strong focus on speech-to-text (STT) and text-to-speech (TTS) technologies. The role demands solving complex problems in data acquisition, training efficiency, reinforcement learning alignment, and low-latency inference.
What You'll Do
- Research, develop, and deploy production-grade machine learning systems for real-time AI interactions
- Push the boundaries of performance in speech processing to achieve state-of-the-art results
- Optimize training pipelines and inference infrastructure for speed and scalability
- Collaborate on challenges spanning data collection, model alignment, and deployment on edge or embedded systems
What We're Looking For
- PhD in a technical field or BS/BA with substantial experience in ML and software engineering
- Minimum of 5 years of hands-on work in ML engineering or applied research, with proficiency in Python or C++
- Proven background in speech, NLP, video processing, or action planning
- Solid grasp of neural networks, data structures, and algorithms
- Experience with PyTorch and other ML frameworks
- Fluency in English
Nice to Have
- Active engagement with recent advancements in voice AI and machine learning
- Experience in pre-training, fine-tuning, RLHF, and evaluating large models
- Background in physics or applied mathematics
- Ability to adapt in a fast-moving, collaborative environment
Work Environment
This role is based in Switzerland with flexibility for remote work within the country. Optional support for future relocation to the United States may be available, subject to eligibility and immigration criteria.
