Research Scientist, Multimodal Generative AI, Google DeepMind
8.8.2025
Description
Design, rapidly implement, and rigorously evaluate cutting-edge deep learning algorithms and data curation for multimodal generative AI, with a particular emphasis on culturally-adapted image and video synthesis.
Report and present research findings and developments clearly and efficiently both internally and externally, verbally and in writing.
Suggest and engage in team collaborations to meet ambitious research goals, while also driving significant individual contributions.
Work in collaboration with our Ethics and Governance teams to ensure our advances in intelligence are developed ethically and provide broad benefits to humanity.
Qualifications
PhD in Computer Science, Artificial Intelligence, Machine Learning, Computer Vision, or equivalent practical experience.
2+ years of relevant experience in deep learning research and development, particularly in generative AI and related to image and video synthesis. This includes diffusion models and autoregressive generative models.
Experience in software development with one or more programming languages (e.g., Python) and deep learning frameworks (e.g., Jax, TensorFlow, PyTorch), with a track record of building high-quality research prototypes and systems.
Demonstrated experience in large-scale training of multimodal generative models.
A track record of research or engineering achievements, including publications in peer-reviewed conferences or journals.