Research on post-training (e.g., RL and SFT) for information-seeking scenarios in Gemini
Research on novel evaluation methods for improving model quality, grounding and factuality
Research on orchestration of tool calls, and improved retrieval methods, for information-seeking scenarios
Qualifications
PhD in a relevant area, or an equivalent research/publication record
Number of years experience: anything from recent PhD onwards
Strong software-engineering skills in addition to a research background
Experience in reinforcement learning
Experience in post-training methods
Experience in LLMs for information-seeking scenarios
Benefits
The US base salary range for this full-time position is between $141,000 - $202,000 + bonus + equity + benefits. Your recruiter can share more about the specific salary range for your targeted location during the hiring process.