Driving research on post-training techniques (e.g., RL, SFT, and preference optimization) specifically tailored for personalization scenarios.
Developing novel evaluation frameworks and simulation methods to measure model quality against user behaviors / feedback.
Designing and training agents capable of orchestrating tools and APIs to deliver hyper-personalized experiences.
Qualifications
PhD in Machine Learning, Computer Science, or a relevant field (or equivalent practical research experience).
A proven track record of research excellence (e.g., publications at top-tier venues like NeurIPS, ICML, ICLR, or significant industry contributions), ranging from recent graduates to experienced researchers.
Strong software engineering skills to complement your research background.
Hands-on experience with modern post-training methods (SFT, RLHF, etc.).
Prior work applying LLMs to personalization, memory, or agentic workflows.\
Benefits
The US base salary range for this full-time position is between $141,000 USD - 244,000 USD + bonus + equity + benefits. Your recruiter can share more about the specific salary range for your targeted location during the hiring process.