The Post-Training team is responsible for training and improving pre-trained models to be deployed into ChatGPT, the API, and potential future products. The team partners closely with research and product teams across the company, and conducts research as a final step to prepare for real world deployment to millions of users, ensuring that our models are safe, efficient, and reliable.
About the Role
As a Research Engineer / Scientist, you will research and develop improvements to our models. Our team works in research areas combining reinforcement learning and products.
We're looking for individuals with strong ML engineering skills and research experience, especially with novel and highly capable models. An ideal candidate is passionate about product-driven research.
This role is based in San Francisco, CA. We use a hybrid work model of 3 days in the office per week and offer relocation assistance to new employees.
In this role, you will:
Own and pursue a research agenda to improve model capability and performance.
Collaborate closely with the other research and product teams, allowing customers to optimize their own models.
Build robust evaluations for tracking modeling improvements.
Design, implement, test, and debug code across our research stack.
Have a deep understanding of machine learning and machine learning applications.
Have a working knowledge of relevant models, and building evaluations for model capability improvement.
Are comfortable diving into a large ML codebase to debug.
Thrive in a dynamic and technically complex environment.