I work on agents at Mistral AI. Previously, I was a Member of Technical Staff at Cohere, where I worked on RL for LLMs and reasoning, and before that a research scientist at InstaDeep, where I focused on transformers for combinatorial optimization and discrete problems. I hold a Ph.D. in reinforcement learning for combinatorial optimization from Inria/CNRS, where I was part of the SequeL/ScooL team under the supervision of P. Preux.
Research Scientist, 2025-present
Mistral AI
Member of Technical Staff, 2024-2025
Cohere
Research Scientist, 2023-2024
Instadeep
PhD Student, 2019-2023
Inria Lille, SequeL/ScooL team