Henryk Michalewski

Senior Staff Research Scientist
Google DeepMind
henrykm@google.com
henrykmichalewski@gmail.com
Oxford, UK
+44 750 825 6803

Bio

Prior to 2015, my research focused on pure mathematics and theoretical computer science, covering logic, foundations, game theory, and optimization. I subsequently pivoted to work exclusively on machine learning. Throughout my career, I have been fortunate to interact with leadership that supported new research directions and collaborate with exceptional engineers and researchers who made it possible to pursue these directions.

Before joining Google in 2019, I worked on reinforcement learning for theorem proving, early deep RL scaling experiments with Intel, a sim2real project with Volkswagen, and model-based RL. At Google, I have contributed to PaLM, early program synthesis work with LLMs, Scratchpad, and Minerva. More recently, I worked on the math-specialized model presented in the Gemini 1.5 report, Big Sleep, AlphaProof, and all iterations of the main Gemini models. Leveraging Google’s infrastructure, I have conducted thousands of experiments and submitted over 1,000 pull requests—roughly half of which were to the Gemini codebase.

Work Engagements

Senior Staff Research Scientist

Google DeepMind, 2025–present

Staff Research Scientist

Google DeepMind, 2023–2025

Staff Research Scientist

Google Brain, 2021–2023

Leverhulme Fellow

Department of Computer Science, University of Oxford, 2021–2022

Visiting Researcher (Staff Faculty Visiting Researcher)

Google, 2019–2021

Visiting Researcher

Department of Computer Science, University of Oxford, 2018–2019

Invited Professor

École normale supérieure de Lyon, 2017

Data Scientist

deepsense.ai, 2016–2019

Associate Professor (last held position)

University of Warsaw, 2007–present, on a long term leave since 2016

Postdoctoral Researcher

Ben-Gurion University, Israel, 2004–2007

Open Source Contributions

Trax — contributions to sequence modeling, training pipelines, and reasoning-focused components.
Formal Putnam-like Benchmark — co-developer of an olympiad-level mathematical reasoning evaluation suite.
Eval-Hub — contributor to a unified evaluation framework for LLM reasoning, code generation, and multimodal tasks.

Olympiad-level formal mathematical reasoning with reinforcement learning

Nature 2025

PDF

Beyond Lines and Circles: Unveiling the Geometric Reasoning Gap in Large Language Models

EMNLP 2024

PDF

Promptbreeder: Self-Referential Self-Improvement via Prompt Evolution

ICML 2024

PDF

Focused Transformer: Contrastive Training for Context Scaling

NeurIPS 2023

PDF

RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control

ArXiv 2023

PDF

Solving Quantitative Reasoning Problems with Language Models

NeurIPS 2022

PDF

Multi-Game Decision Transformers

NeurIPS 2022

PDF

Hierarchical Transformers Are More Efficient Language Models

NAACL 2022 (Findings)

PDF

Program Synthesis with Large Language Models

ArXiv 2021

PDF

Show Your Work: Scratchpads for Intermediate Computation with Language Models

ArXiv 2021

PDF

Measuring and Improving BERT’s Mathematical Abilities by Predicting the Order of Reasoning

ACL 2021

PDF

Sim2Real Autonomous Driving

ICRA 2020

PDF

Model-Based RL for Atari

ICLR 2020

PDF

Reinforcement Learning of Theorem Proving

NeurIPS 2018

PDF

Distributed Deep Reinforcement Learning: Learn how to play Atari games in 21 minutes

ISC High Performance 2018

PDF

Logical Strength of Rabin’s Theorem

LICS 2015

PDF

MSO+U on Infinite Trees

ICALP 2014

PDF

Patents based on the Scratchpad paper contributions

Prompting Machine-Learned Models Using Chains of Thought — chain-of-thought prompting and consistency-based selection of model outputs for improved reasoning robustness.
Using Chains of Thought to Prompt Machine-Learned Models Pre-Trained on Diversified Objectives — construction of instructive query–answer–reasoning triples for steering large pre-trained models via chain-of-thought prompts.

Education

Habilitation in Computer Science

University of Warsaw, 2015

Thesis

Internship

Fields Institute, Toronto, Winter 2002

PhD in Mathematics

University of Warsaw, 1998–2002

Thesis

Internship

Vrije University, Amsterdam, 1998

MA in Mathematics

University of Warsaw, 1993–1998

Thesis

Mentoring of Students

Spyridon Mouselinos, 2021–2025, Ph.D. project
Towards Visual Reasoning,
co-supervised with Mateusz Malinowski (Google DeepMind).

Cécilia Pradic, 2014–2019, PhD thesis
Some proof-theoretical approaches to Monadic Second-Order logic,
co-supervised with Colin Riba (ENS Lyon).

Powered by Jekyll and Minimal Light theme.