Related jobs

OpenAI›ML Engineer›

ML Engineer
San Francisco

›

Fast-track your ML job hunt :

Be the first to hear about new sota jobs
+ exclusive salary research + career cheatsheets.

OpenAI · San Francisco · Hybrid

RE/RS, Data Understanding - Foundations

5/29/2026

Description

We're looking to advance how OpenAI builds and understands pretraining data at scale. You'll treat data quality and curation as core research problems: developing new methods to select, combine, and transform data; creating datasets that improve model capabilities; and designing rigorous experiments to understand how data choices and interventions affect model learning and downstream behavior. You'll work closely with frontier models and web-scale data to build evidence for which approaches work and why, then translate successful research into scalable data processing pipelines

Qualifications

Have a strong track record of new or improved ML ideas, through publications, projects, or applied research.
Own and drive a research agenda, from choosing the right problems to carrying long-running work through to impact.
Be excited by OpenAI’s empirical, collaborative approach to research.

Nice To Have

Thoughtfulness about AI’s impact, including privacy, provenance, and data quality.
Experience building high-performance deep learning or large-scale data processing systems.

Application

View listing at origin and apply!

Related jobs

OpenAI›ML Engineer›

ML Engineer
San Francisco

›

Fast-track your ML job hunt :

Be the first to hear about new sota jobs + exclusive salary research + career cheatsheets.

About

Related jobs

RE/RS, Data Understanding - Foundations

Description

Qualifications

Application

Related jobs