Anthropic · San Francisco/New York City/Seattle · Hybrid
Research Engineer / Research Scientist, Tokens
10/11/2025
Description
- Optimizing the throughput of a new attention mechanism
- Comparing the compute efficiency of two Transformer variants
- Making a Wikipedia dataset in a format models can easily consume
- Scaling a distributed training job to thousands of GPUs
- Writing a design doc for fault tolerance strategies
- Creating an interactive visualization of attention between tokens in a language model
Qualifications
- Have significant software engineering experience
- Are results-oriented, with a bias towards flexibility and impact
- Pick up slack, even if it goes outside your job description
- Enjoy pair programming (we love to pair!)
- Want to learn more about machine learning research
- Care about the societal impacts of your work
Benefits
$340,000 - $425,000 USD
Application
View listing at origin and apply!