Description
- Develop models, tools, metrics, and datasets for assessing and evaluating the safety of generative models over the model deployment lifecycle
- Develop models, models, and tools to interpret and explain failures in language and diffusion models
- Build and maintain human annotation and red teaming pipelines to assess quality and risk of various Apple products
- Prototype, implement, and evaluate new ML models and algorithms for red teaming LLMs