Remote-Friendly (Travel-Required) | San Francisco, CA | Seattle, WA | New York City, NY Hybrid $350,000 - $500,000 USD

Anthropic is hiring a Research Engineer, Reward Models Platform

Responsibilities

  • Construct infrastructure to support rapid experimentation with reward signals, including tools for creating evaluation rubrics and analyzing human feedback data
  • Build automated systems to assess reward quality and detect anomalies such as reward hacking or unintended behaviors
  • Develop software that enables side-by-side comparison of different reward modeling approaches and their impact
  • Design end-to-end pipelines that streamline reward model development, from data collection to deployment
  • Implement observability tools to monitor reward signal integrity during training processes
  • Work closely with research teams to convert scientific objectives into scalable technical solutions
  • Improve existing platforms for better speed, stability, and usability
  • Help establish and document standardized practices for reward model development

Team

a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems

About company
Anthropic
Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole.
All jobs at Anthropic Visit website
Job Details
Department Fine-Tuning
Category data
Posted 15 days ago