Description
We are looking for a Machine Learning Engineer to help us train Claude specifically for virtual collaborator workflows. While Claude excels at general tasks, a lot of knowledge work requires targeted training on real organizational data and workflows. Your job will be to design and implement reinforcement learning environments that transform Claude into the best virtual collaborator, training on everything from navigating internal knowledge to creating financial models.
Responsibilities:
- Designing and implementing reinforcement learning pipelines specifically targeted at virtual collaborator use cases (productivity, organizational navigation, vertical domains)
- Building and scaling our data creation platform for generating high-quality, open-ended tasks with domain experts and crowdworkers Integrating real organizational data to create authentic training environments
- Developing robust rubric-based evaluation systems that maintain quality while avoiding reward hacking
- Training Claude on advanced document manipulation, including understanding, enhancing, and co-creating
- Partnering directly with product teams to ensure training aligns with shipped features