Google DeepMind · Tokyo

Research Engineer - Multimodal Companion Agent

9/23/2025

Qualifications

  • Bachelors/Masters/Ph.D. in Computer Science, Artificial Intelligence, or a related field.
  • Experience with relevant ML frameworks such as JAX, TensorFlow, or PyTorch.
  • Strong programming skills in Python and experience with large-scale data pipelines.
  • Solid understanding of LLM internals, e.g., typical training pipelines, computational characteristics of training/inference, mechanisms for multimodal extension.
  • Knowledge of Deep Reinforcement Learning (RL), LLM Reasoning, Imitation Learning, Memory-Based Architectures, Vision-Language-Model (VLM), and/or Vision-Language-Action (VLA) models.
  • Proven track record of designing, implementing, and maintaining robust technical assets (such as libraries, frameworks, or models) used by a large number of technical stakeholders; experience with OSS contributions is a plus.
  • Excellent communication and collaboration skills.
  • A minimum of 5 years of relevant professional experience.
  • Experience building agents for 3D virtual environments, simulators, or video games.
  • Strong track record in competitions in machine learning, data science, or AI in games.
  • Strong track record in AI competitions or publications in top-tier conferences (NeurIPS, ICLR, ICML, CVPR, etc.).

Application

View listing at origin and apply!