About

Google DeepMind

Google DeepMind · Mountain View

Research Engineer, Model Security Against Adversarial Misuse

8/29/2025

Description

Develop and deploy advanced AI/ML solutions to identify and mitigate potential cybersecurity misuse, leveraging prompt engineering, machine learning, and post-training.
Implement advanced post-training algorithms to optimize Gemini for cybersecurity misuse prevention.
Diagnose and interpret training outcomes (regressions, gains), and propose and implement solutions to improve model capabilities.
Actively monitor and evolve the overall defense system's mitigation capability through system metric design and improvement.
Develop reliable automated evaluation pipeline for cybersecurity misuse mitigation metrics that are strongly correlated with human expert judgment of threat severity.
Construct adversarial evaluation benchmarks to probe the limits and failure modes of cybersecurity misuse mitigation performance.

Qualifications

BSc, MSc or PhD/DPhil degree in computer science, applied stats, machine learning or similar experience working in industry
Experiences in fine-tuning and adaptation of LLMs (e.g. advanced prompting, supervised fine-tuning, RLHF)
Cybersecurity knowledge and background is a great advantage.
Deep understanding of machine learning and statistics
Strong knowledge of systems design and data structures
Proven experience with TensorFlow, JAX, PyTorch, or similar leading deep learning frameworks
Recent experience conducting applied research to improve the quality and training/serving efficiency of large transformer-based models
A passion for Artificial Intelligence and Cybersecurity.
Excellent communication skills and proven interpersonal skills, with a track record of effective collaboration with cross-functional teams

Benefits

The US base salary range for this full-time position is between $166,000 - $291,000 + bonus + equity + benefits. Your recruiter can share more about the specific salary range for your targeted location during the hiring process.

Application

View listing at origin and apply!

Content

About

About

Related jobs

Research Engineer, Model Security Against Adversarial Misuse

Description

Qualifications

Benefits

Application

Related jobs