Google DeepMind · New York City

Research Scientist, Audio

3/13/2026

Description

Research Scientists at Google DeepMind lead our efforts in developing novel algorithmic architecture towards the end goal of solving and building Artificial General Intelligence.

In this role, responsibilities will include making key contributions into the latest research developed in the Gemini audio pillar, such as:

Key responsibilities:

  • Data: Unlocking new audio to X capabilities within the model, both in pre-training and post-training.
  • Models: Improving quality of models for understanding and generation. This includes research to improve our tokenizers, better techniques for generation quality, and looking at joint audio and visual representations. 
  • Evals: Better evaluation methods (human, auto raters, automated metrics) to measure quality of open-ended tasks.

Qualifications

  • PhD in Computer Science, Computer Vision, Speech Processing, or Machine Learning related field.
  • Experience working with LLMs.
  • Audio or video understanding and/or generation experience.
  • A proven track record of research and publications in some of the following areas: audio generation, video generation, LLMs
  • A real passion for AI!

Benefits

The US base salary range for this full-time position is between $147,000 - $211,000 + bonus + equity + benefits. Your recruiter can share more about the specific salary range for your targeted location during the hiring process.

Application

View listing at origin and apply!

Fast-track your ML job hunt :

Be the first to hear about new sota jobs + exclusive salary research + career cheatsheets.