About the Role
Design and implement systems that integrate reinforcement learning techniques with language models to improve performance and enable new capabilities in real-world applications.
Responsibilities
- Develop and refine integration frameworks between machine learning models and production environments
- Implement and optimize reinforcement learning pipelines for model alignment and improvement
- Collaborate with research and engineering teams to deploy scalable AI solutions
- Contribute to the design of experiments that evaluate model behavior and performance
- Work on infrastructure to support training, evaluation, and deployment of frontier models
- Identify and solve technical challenges in model integration and feedback loops
- Support the development of tools that enable developers to build AI-powered applications
- Iterate quickly on prototypes and production systems based on empirical results
- Maintain high standards for code quality, testing, and documentation
- Engage in cross-functional efforts to improve product capabilities and customer value
Nice to Have
- Prior work on language model alignment or fine-tuning
- Experience with model deployment and API integration
- Knowledge of semantic search, retrieval-augmented generation, or agent systems
- Contributions to open-source machine learning projects
- Research publications in relevant AI conferences
Compensation
Competitive salary and equity package
Work Arrangement
Remote-friendly within UTC−06:00 to UTC+01:00
Team
Collaborative team of top-tier researchers, engineers, and designers advancing the state of AI
Who are we?
Our mission is to scale intelligence to serve humanity. We’re training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. We believe that our work is instrumental to the widespread adoption of AI. We obsess over what we build. Each one of us is responsible for contributing to increasing the capabilities of our models and the value they drive for our customers. We like to work hard and move fast to do what’s best for our customers. This organization is a team of researchers, engineers, designers, and more, who are passionate about their craft. Each person is one of the best in the world at what they do. We believe that a diverse range of perspectives is a requirement for building great products. Join us on our mission and shape the future!
Note
- We have offices in London, Paris, Toronto, San Francisco, New York but we are also remote-friendly! Applicants for this role may work anywhere between UTC−06:00 and UTC+01:00.
- This post is co-authored by both company humans and company technology.
If some of the above doesn’t line up perfectly with your experience
If some of the above doesn’t line up perfectly with your experience, we still encourage you to apply! If you want to work really hard on a glorious mission with teammates that want the same thing, this is the place for you.
Diversity and Inclusion
We value and celebrate diversity and strive to create an inclusive work environment for all. We welcome applicants from all backgrounds and are committed to providing equal opportunities. Should you require any accommodations during the recruitment process, please submit an Accommodations Request Form, and we will work together to meet your needs.
Available for qualified candidates


