Related jobs

Google DeepMind›Research Engineer›

Research Engineer
Mountain View

›

Google DeepMind · Mountain View

Research Engineer, Multimodal AI

1.10.2025

Description

You will be part of the REMY (Research & dEvelopment in Multimodal technologY) team in the Media Understanding organization, at Google DeepMind. In this role, you will have the opportunity to push forward state-of-the-art research in multimodal AI representation models, in the context of recent advancements in multimodal foundation models generally. You'll be at the forefront of developing models that power Google products used by billions of people worldwide. Your work will directly impact how these products understand and interact with images, text and video. This is a unique opportunity to shape the future of multimodal AI and its applications in a dynamic and impactful environment.

We are a team of research/software engineers, research scientists, and machine learning experts, working together to enable superhuman understanding of the multimodal world.

You'll be developing the next SOTA models for multimodal understanding. Your work will include researching new modeling techniques, implementing research ideas, running experiments to evaluate improvements, and identifying new opportunities.

Key responsibilities:

As a member of the Media Understanding team, you will be responsible for conducting fundamental and applied research in multimodal AI (computer vision, language understanding, machine learning, and related areas). Your job responsibilities will include:

Conducting core research in the areas of computer vision, language understanding, multimodal models, large scale AI models and other key computer vision tasks.
Training and evaluating AI models for a variety of use cases.
Researching, implementing, and adapting state of the art deep learning approaches for Google’s use cases
Collaborating closely with other GDM and partner teams to make progress towards building the most advanced representation models.

Qualifications

Ph.D. in Computer Science or related quantitative field, or B.S./M.S. in Computer Science or related quantitative field with 2+ years of relevant experience.
Conduct research to identify and address impactful problems inspired by current and future real-world needs. Investigate and develop novel solutions by studying related work, conducting experiments, and constructing prototypes and demonstrations.
Innovate and assess new machine learning models and techniques for pilot projects, quickly demonstrating viability and potential impact. Transform successful prototypes into scalable solutions for wider integration within Google's products.
Collaborate with product teams to drive the implementation of research insights, fostering innovation and the development of new products.

Strong research experience and publication record in top tier conferences.
Experience with core software engineering and applied implementations of AI
A good team player who has demonstrated that they can work across teams.

Benefits

The US base salary range for this full-time position is between $141,000 - $202,000 + bonus + equity + benefits. Your recruiter can share more about the specific salary range for your targeted location during the hiring process.

Application

View listing at origin and apply!

Related jobs

Google DeepMind›Research Engineer›

Research Engineer
Mountain View

›

About

Related jobs

Research Engineer, Multimodal AI

Description

Key responsibilities:

Qualifications

Benefits

Application

Related jobs