Avatar

Agents & LLMs

Mistral AI

Biography

I work on agents at Mistral AI. Previously, I was a Member of Technical Staff at Cohere, where I worked on RL for LLMs and reasoning, and before that a research scientist at InstaDeep, where I focused on transformers for combinatorial optimization and discrete problems. I hold a Ph.D. in reinforcement learning for combinatorial optimization from Inria/CNRS, where I was part of the SequeL/ScooL team under the supervision of P. Preux.

Interests

  • Agents
  • Large Language Models
  • Reinforcement Learning
  • Combinatorial Optimization

Experience & Education

  • Research Scientist, 2025-present

    Mistral AI

  • Member of Technical Staff, 2024-2025

    Cohere

  • Research Scientist, 2023-2024

    Instadeep

  • PhD Student, 2019-2023

    Inria Lille, SequeL/ScooL team

Experience

 
 
 
 
 

Research Scientist

Mistral AI

Sep 2025 – Present Paris, France
Agents.
 
 
 
 
 

Member of Technical Staff

Cohere

Apr 2024 – Sep 2025 Paris, France
RL for LLMs and reasoning.
 
 
 
 
 

Research Scientist

InstaDeep

Apr 2023 – Apr 2024 London, UK
RL and transformers for combinatorial optimization.
 
 
 
 
 

Research Intern

InstaDeep

Apr 2022 – Oct 2022 London, UK
RL for combinatorial optimization, under the supervision of Thomas D. Barrett. Led to: Population-Based Reinforcement Learning for Combinatorial Optimization.
 
 
 
 
 

PhD Student

Inria

Oct 2019 – Apr 2023 Lille, France
Reinforcement learning for combinatorial optimization, graph representation. Under the supervision of P. Preux.
 
 
 
 
 

Graduate Research Intern

UC Berkeley

Apr 2018 – Aug 2018 California
Machine learning and statistics to study biological scRNA-seq data. Under the supervision of S. Dudoit.
 
 
 
 
 

Blockchain Developer (intern)

BitSpread Ltd

Jun 2017 – Nov 2017 London, UK
Developed Ethereum smart-contracts to create a decentralized investment fund.

Contact