Description
In this role you will have the opportunity to develop data generation methodologies for personalized model and search systems evaluations, online and offline metrics for quality assessment and build scalable systems for comprehensive evaluation. Our team is responsible for models and evaluation of search experiences that answer user’s questions using their personal documents with privacy at the forefront.
Role responsibilities include:
- Design and implement novel data generation and data quality assessment methods to support modeling and evaluation of personal Q&A.
- Build scalable, automated systems for large scale, end-to-end evaluation of models and search powered systems.
- Design and implement offline and online metrics to assess product and component level quality for personal Q&A.
- Collaborate with partner teams to define data and evaluation requirements and priorities, and to explore opportunities for enhancements to the Personal Q&A stack.
- Develop long-term technical vision for Personal Q&A quality; identify problem areas and drive solutions as part of a larger roadmap.