Anthropic · San Francisco · Hybrid

Research Engineer, Performance RL

3/24/2026

Description

  • Developing systems that enable models to use computers effectively

  • Advancing code generation through reinforcement learning

  • Pioneering fundamental RL research for large language models

  • Building scalable RL infrastructure and training methodologies

  • Enhancing model reasoning capabilities

Qualifications

  • Have expertise with accelerators (CUDA, ROCm, Triton, Pallas), ML framework programming (JAX or PyTorch).

  • Have worked across the stack – kernels, model code, distributed systems.

  • Know how to balance research exploration with engineering implementation.

  • Are passionate about AI's potential and committed to developing safe and beneficial systems.

  • Experience with reinforcement learning.

  • Experience porting ML workloads between different types of accelerators.

  • Familiarity with LLM training methodologies.

Benefits

$350,000 - $850,000 USD

Application

View listing at origin and apply!

Fast-track your ML job hunt :

Be the first to hear about new sota jobs + exclusive salary research + career cheatsheets.