Luma AI is hiring a Research Scientist / Engineer – Multimodal Capabilities

About the Role

Luma AI is looking for a Research Scientist / Engineer – Multimodal Capabilities to unlock advanced behaviors in our foundation models. You'll join the Multimodal Capabilities team to conduct strategic research on combining vision, audio, and language to solve fundamental questions.

What You'll Do

  • Collaborate with the Foundation Models team to identify capability gaps and research solutions.
  • Design datasets, experiments, and methodologies to systematically improve model capabilities across vision, audio, and language.
  • Develop evaluation frameworks and benchmarking approaches for multimodal AI capabilities.
  • Create prototypes and demonstrations that showcase new multimodal capabilities.

What We're Looking For

  • Strong programming skills in Python and PyTorch.
  • Experience with multimodal data processing pipelines and large-scale dataset curation.
  • Understanding of computer vision, audio processing, and/or natural language processing techniques.

Nice to Have

  • Expertise working with interleaved multimodal data.
  • Hands-on experience with Vision Language Models, Audio Language Models, or generative video models.

Technical Stack

  • Python, PyTorch

Team & Environment

You will be part of the Multimodal Capabilities team and collaborate closely with the Foundation Models team.

Benefits & Compensation

  • Salary: $200,000 - $300,000/yr + competitive equity in the form of stock options.
  • A comprehensive benefits plan.

Luma AI is an equal opportunity employer.

Required Skills
PythonPyTorchMachine LearningMultimodal AIComputer VisionNatural Language ProcessingDeep LearningResearchModel TrainingLarge-scale Systems PythonPyTorchMachine LearningMultimodal AIComputer VisionNatural Language ProcessingDeep LearningResearchModel TrainingLarge-scale Systems
Scaling your freelance income?

Invoice multiple clients effortlessly

Managing 3+ international clients? Glopay streamlines everything. One EU company, unlimited invoices, automatic compliance. You just send and get paid.

Unlimited clients & invoices
Multi-currency support
Automated tax compliance
Client portal for easy payments
Scale with Glopay
Trusted by 10,000+ freelancers
About company
Luma AI
Luma AI is a technology company focused on developing advanced multimodal AI foundation models, working on innovative approaches to combining vision, audio, and language data.
All jobs at Luma AI Visit website
Job Details
Category data
Posted 8 months ago