PhD in Machine Learning, Reinforcement Learning, Natural Language Processing, or a related field.
Strong data analysis and synthetic data generation skills.
Strong development skills in Python and experience with deep learning frameworks like JAX, PyTorch, or TensorFlow.
Experience building and working with large-scale ML training systems.
Deep theoretical and practical experience in Reinforcement Learning (e.g., policy gradient methods, value-based methods, model-based RL, credit assignment).
Experience developing and training large generative models (LLMs).
Strong track record of academic publications in top-tier conferences (e.g., NeurIPS, ICML, ICLR, AAAI).
Familiarity with research on game theory, multi-agent systems, or learning from human feedback (RLHF/RLAIF).
Experience building or using user simulators for RL training.
Benefits
The US base salary range for this full-time position is between $166,000 - $291,000 + bonus + equity + benefits. Your recruiter can share more about the specific salary range for your targeted location during the hiring process.