Olivier Sigaud

Olivier Sigaud
Full Professor
Team: MLIA
Office: 56-66 301
Email: olivier.sigaud@sorbonne-universite.fr
Phone:01.44.27.88.53
Addresse: ISIR, Campus Pierre et Marie Curie, 4 place Jussieu, BC173, 75005 Paris
Bio: Born in 1968. Engineer, Phd in Computer Science from Paris XI University (Orsay) in 1996, PhD in Philosophy from Paris I University in 2004. Employed at Dassault Aviation (aerospace company, St-Cloud) from 1995 to 2001. Then Assistant Professor and finally Professor at LIP6 then ISIR. Main research topics: reinforcement learning, learning for robotics, computational neurosciences of decision making in animals.

Publication year

Type of document

Journal articles
Conference paper
Book sections
These
Others

Publications

Clémence Grislain, Hamed Rahimi, Olivier Sigaud, Mohamed Chetouani. I-FailSense: Towards General Robotic Failure Detection with Vision-Language Models. IEEE International Conference on Robotics & Automation (ICRA 2026), Jun 2026, Vienne, Austria. ⟨hal-05519469⟩
[ HTTP | PDF ]
Alexandre Chenu, Olivier Serris, Olivier Sigaud, Nicolas Perrin-Gilbert. Leveraging Sequentiality in Reinforcement Learning from a Single Demonstration. 2025 IEEE-RAS 24th International Conference on Humanoid Robots (Humanoids), Sep 2025, Seoul, France. pp.405-412, ⟨10.1109/Humanoids65713.2025.11203179⟩. ⟨hal-05398834⟩
[ HTTP ]
Mohamed Salim Aissi, Clémence Grislain, Mohamed Chetouani, Olivier Sigaud, Laure Soulier, et al.. VIPER: Visual Perception and Explainable Reasoning for Sequential Decision-Making. 2026. ⟨hal-05525519⟩
[ HTTP | PDF ]
Nicolas Castanet, Olivier Sigaud, Sylvain Lamprier. Imagine Beyond! Distributionally Robust Auto-Encoding for State Space Coverage in Online Reinforcement Learning. Neurips 2025, Dec 2025, San DIego, United States. ⟨hal-05118820⟩
[ HTTP | PDF ]
Mohamed Salim Aissi, Clément Romac, Thomas Carta, Sylvain Lamprier, Pierre-Yves Oudeyer, et al.. Reinforcement Learning for Aligning Large Language Models Agents with Interactive Environments: Quantifying and Mitigating Prompt Overfitting. NAACL 2025 - Findings of the Association for Computational Linguistics, Apr 2025, Albuquerque, United States. pp.7030-7046, ⟨10.18653/v1/2025.findings-naacl.390⟩. ⟨hal-05321901⟩
[ HTTP | PDF ]
Loris Gaven, Thomas Carta, Clément Romac, Cédric Colas, Sylvain Lamprier, et al.. MAGELLAN: Metacognitive predictions of learning progress guide autotelic LLM agents in large goal spaces. ICML 2025 - 42nd International Conference on Machine Learning, Jul 2025, Vancouver (BC), Canada. ⟨hal-05302437⟩
[ HTTP | PDF ]
Mohamed Salim Aissi, Clément Romac, Thomas Carta, Sylvain Lamprier, Pierre-Yves Oudeyer, et al.. Reinforcement Learning for Aligning Large Language Models Agents with Interactive Environments: Quantifying and Mitigating Prompt Overfitting. 2024. ⟨hal-04844077⟩
[ HTTP ]
Loris Gaven, Clément Romac, Thomas Carta, Sylvain Lamprier, Olivier Sigaud, et al.. SAC-GLAM: Improving Online RL for LLM agents with Soft Actor-Critic and Hindsight Relabeling. IMOL 2024 - Intrinsically Motivated Open-ended Learning (Workshop at Neurips), Dec 2024, Vancouver, Canada. 2024. ⟨hal-04844089⟩
[ HTTP ]
Alexandre Chenu, Olivier Serris, Olivier Sigaud, Nicolas Perrin-Gilbert. Single-Reset Divide & Conquer Imitation Learning. 2024. ⟨hal-04822877⟩
[ HTTP ]
Vaynee Sungeelee, Antoine Loriette, Olivier Sigaud, Baptiste Caramiaux. Interactive curriculum learning increases and homogenizes motor smoothness. Scientific Reports, 2024, ⟨10.1038/s41598-024-53253-3⟩. ⟨hal-04529557⟩
[ HTTP | PDF ]
Nicolas Castanet, Olivier Sigaud, Sylvain Lamprier. Stein Variational Goal Generation for adaptive Exploration in Multi-Goal Reinforcement Learning. ICML, Jul 2023, Honolulu (Hawai), United States. ⟨hal-05092634⟩
[ HTTP | PDF ]
Vaynee Sungeelee, Antoine Loriette, Olivier Sigaud, Baptiste Caramiaux. Co-Apprentissage Humain-Machine: Cas d'Étude en Acquisition de Compétences Motrices. 34ème conférence Francophone sur l'Interaction Humain-Machine, Apr 2023, Troyes, France. ⟨hal-03992717⟩
[ HTTP | PDF ]
Vaynee Sungeelee, Antoine Loriette, Olivier Sigaud, Baptiste Caramiaux. Co-Apprentissage Humain-Machine : Cas d’Étude en Acquisition de Compétences Motrices. 2023. ⟨hal-04014981⟩
[ HTTP | PDF ]
Thomas Carta, Clément Romac, Thomas Wolf, Sylvain Lamprier, Olivier Sigaud, et al.. Grounding Large Language Models in Interactive Environments with Online Reinforcement Learning. International Conference on Machine Learning 2023, Jul 2023, Honololu, Hawaii, United States. ⟨hal-03970122v3⟩
[ HTTP | PDF ]
Olivier Sigaud, Gianluca Baldassarre, Cédric Colas, Stephane Doncieux, Richard Duro, et al.. A Definition of Open-Ended Learning Problems for Goal-Conditioned Agents. 2024. ⟨hal-04810195⟩
[ HTTP | PDF ]
Alexandre Chenu, Nicolas Perrin-Gilbert, Olivier Sigaud. Divide & Conquer Imitation Learning. 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2022), Oct 2022, Kyoto, Japan. pp.8630-8637, ⟨10.1109/IROS47612.2022.9982020⟩. ⟨hal-03753530⟩
[ HTTP | PDF ]
Thomas Carta, Sylvain Lamprier, Pierre-Yves Oudeyer, Olivier Sigaud. EAGER: Asking and Answering Questions for Automatic Reward Shaping in Language-guided RL. NeurIPS 2022 - Thirty-sixth Conference on Neural Information Processing Systems, Nov 2022, Nouvelle-Orléans, United States. ⟨hal-03902423⟩
[ HTTP | PDF ]
Hugo Caselles-Dupré, Olivier Sigaud, Mohamed Chetouani. Pragmatically Learning from Pedagogical Demonstrations in Multi-Goal Environments. Thirty-sixth Conference on Neural Information Processing Systems (NeurIPS 2022), Nov 2022, New Orleans, United States. ⟨hal-03828002⟩
[ HTTP | PDF ]
Alexandre Chenu, Olivier Serris, Olivier Sigaud, Nicolas Perrin-Gilbert. Leveraging Sequentiality in Reinforcement Learning from a Single Demonstration. 2022. ⟨hal-04823341⟩
[ HTTP ]
Aloïs Pourchot, Kevin Bailly, Alexis Ducarouge, Olivier Sigaud. Neural Architecture Search for Fracture Classification. 29th IEEE International Conference on Image Processing (ICIP 2022), Oct 2022, Bordeaux, France. ⟨hal-03753702⟩
[ HTTP | PDF ]
Ahmed Akakzia, Olivier Sigaud. LEARNING OBJECT-CENTERED AUTOTELIC BEHAVIORS WITH GRAPH NEURAL NETWORKS. Conference on Lifelong Learning Agents - CoLLAs 2022, Aug 2022, Montréal, Canada. ⟨hal-03753526⟩
[ HTTP | PDF ]
Aloïs Pourchot, Kévin Bailly, Alexis Ducarouge, Olivier Sigaud. An extensive appraisal of weight-sharing on the NAS-Bench-101 benchmark. Neurocomputing, 2022, 498, pp.28-42. ⟨10.1016/j.neucom.2022.04.108⟩. ⟨hal-03706214⟩
[ HTTP | PDF ]
Thomas Pierrot, Valentin Macé, Felix Chalumeau, Arthur Flajolet, Geoffrey Cideron, et al.. Diversity policy gradient for sample efficient quality-diversity optimization. GECCO '22: Genetic and Evolutionary Computation Conference, Jul 2022, Boston, United States. pp.1075-1083, ⟨10.1145/3512290.3528845⟩. ⟨hal-03864262⟩
[ HTTP ]
Cédric Colas, Tristan Karch, Olivier Sigaud, Pierre-Yves Oudeyer. Autotelic Agents with Intrinsically Motivated Goal-Conditioned Reinforcement Learning: A Short Survey. Journal of Artificial Intelligence Research, 2022, 74, ⟨10.1613/jair.1.13554⟩. ⟨hal-03901771⟩
[ HTTP | PDF ]
Thomas Pierrot, Valentin Macé, Felix Chalumeau, Arthur Flajolet, Geoffrey Cideron, et al.. DIVERSITY POLICY GRADIENT FOR SAMPLE EFFI-CIENT QUALITY-DIVERSITY OPTIMIZATION. Workshop on Agent Learning in Open-Endedness (ALOE) at ICLR 2022, 2022, virtual, Vatican City. ⟨hal-03753541⟩
[ HTTP | PDF ]
Olivier Sigaud, Hugo Caselles-Dupré, Cédric Colas, Ahmed Akakzia, Pierre-Yves Oudeyer, et al.. Towards Teachable Autonomous Agents. 2021. ⟨hal-03364200⟩
[ HTTP | PDF ]
Thomas Pierrot, Nicolas Perrin-Gilbert, Olivier Sigaud. First-Order and Second-Order Variants of the Gradient Descent in a Unified Framework. 30th International Conference on Artificial Neural Networks - ICANN 2021, Sep 2021, Bratislava, Slovakia. pp.197-208, ⟨10.1007/978-3-030-86340-1_16⟩. ⟨hal-03404369⟩
[ HTTP ]
Alexandre Chenu, Nicolas Perrin-Gilbert, Stéphane Doncieux, Olivier Sigaud. Selection-Expansion: A Unifying Framework for Motion-Planning and Diversity Search Algorithms. 30th International Conference on Artificial Neural Networks - ICANN 2021, Sep 2021, Bratislava, Slovakia. pp.568-579, ⟨10.1007/978-3-030-86380-7_46⟩. ⟨hal-03404366⟩
[ HTTP ]
Ahmed Akakzia, Cédric Colas, Pierre-Yves Oudeyer, Mohamed Chetouani, Olivier Sigaud. Grounding Language to Autonomously-Acquired Skills via Goal Generation. ICLR 2021 - Ninth International Conference on Learning Representation, May 2021, Vienna / Virtual, Austria. ⟨hal-03121146⟩
[ HTTP | PDF ]
Anis Najar, Olivier Sigaud, Mohamed Chetouani. Teaching a Robot with Unlabeled Instructions: The TICS Architecture. AAMAS 2021, May 2021, London (virtual), United Kingdom. ⟨hal-03224574⟩
[ HTTP ]
Alexandre Chenu, Nicolas Perrin-Gilbert, Stephane Doncieux, Olivier Sigaud. Selection-Expansion: A Unifying Framework for Motion-Planning and Diversity Search Algorithms. 2021. ⟨hal-03196479⟩
[ HTTP | PDF ]
Cédric Colas, Ahmed Akakzia, Pierre-Yves Oudeyer, Mohamed Chetouani, Olivier Sigaud. Language-Conditioned Goal Generation: a New Approach to Language Grounding in RL. 2021. ⟨hal-03099887⟩
[ HTTP | PDF ]
Cédric Colas, Tristan Karch, Olivier Sigaud, Pierre-Yves Oudeyer. Intrinsically Motivated Goal-Conditioned Reinforcement Learning: a Short Survey. 2021. ⟨hal-03099891⟩
[ HTTP | PDF ]
Pierre Fournier, Cédric Colas, Mohamed Chetouani, Olivier Sigaud. CLIC: Curriculum Learning and Imitation for object Control in non-rewarding environments. IEEE Transactions on Cognitive and Developmental Systems, 2021, 13 (2), pp.239-248. ⟨10.1109/TCDS.2019.2933371⟩. ⟨hal-02370859⟩
[ HTTP ]
Ryan Lober, Olivier Sigaud, Vincent Padois. Task Feasibility Maximization using Model-Free Policy Search and Model-Based Whole-Body Control. Frontiers in Robotics and AI, 2020, 7, ⟨10.3389/frobt.2020.00061⟩. ⟨hal-01620370v3⟩
[ HTTP | PDF ]
Stephane Doncieux, Nicolas Bredeche, Léni Kenneth Le Goff, Benoît Girard, Alexandre Coninx, et al.. DREAM Architecture: a Developmental Approach to Open-Ended Learning in Robotics. 2020. ⟨hal-02562103⟩
[ HTTP | PDF ]
Anis Najar, Olivier Sigaud, Mohamed Chetouani. Interactively shaping robot behaviour with unlabeled human instructions. Autonomous Agents and Multi-Agent Systems, 2020, 34 (2), ⟨10.1007/s10458-020-09459-6⟩. ⟨hal-02996137⟩
[ HTTP | PDF ]
Marwen Belkaid, Elise Bousseyrol, Romain Durand-de Cuttoli, Malou Dongelmans, Etienne K Duranté, et al.. Author Correction: Mice adaptively generate choice variability in a deterministic task. Communications Biology, 2020, 3 (1), pp.54. ⟨10.1038/s42003-020-0785-8⟩. ⟨hal-05393070⟩
[ HTTP | PDF ]
Guillaume Matheron, Nicolas Perrin, Olivier Sigaud. Understanding Failures of Deterministic Actor-Critic with Continuous Action Spaces and Sparse Rewards. Artificial Neural Networks and Machine Learning – ICANN 2020, Sep 2020, Bratislava, Slovakia. pp.308-320, ⟨10.1007/978-3-030-61616-8_25⟩. ⟨hal-03080925⟩
[ HTTP | PDF ]
Guillaume Matheron, Nicolas Perrin, Olivier Sigaud. PBCS: Efficient Exploration and Exploitation Using a Synergy Between Reinforcement Learning and Motion Planning. Artificial Neural Networks and Machine Learning – ICANN 2020, Sep 2020, Bratislava, Slovakia. pp.295-307, ⟨10.1007/978-3-030-61616-8_24⟩. ⟨hal-03080918⟩
[ HTTP | PDF ]
Marwen Belkaid, Elise Bousseyrol, Romain Durand-de Cuttoli, Malou Dongelmans, Etienne K Duranté, et al.. Mice adaptively generate choice variability in a deterministic task. Communications Biology, 2020, 3, pp.34. ⟨10.1038/s42003-020-0759-x⟩. ⟨hal-02485779⟩
[ HTTP | PDF ]
Felix Rutard, Olivier Sigaud, Mohamed Chetouani. Tirl: enriching actor-critic RL with non-expert human teachers and a trust model. The 29th IEEE International Conference on Robot and Human Interactive Communication, RO-MAN, 2020, Napoli, Italy. ⟨hal-03124262⟩
[ HTTP | PDF ]
Thomas Pierrot, Nicolas Perrin, Olivier Sigaud. First-order and second-order variants of the gradient descent in a unified framework. 2019. ⟨hal-02397757⟩
[ HTTP ]
Aloïs Pourchot, Nicolas Perrin, Olivier Sigaud. Importance mixing: Improving sample reuse in evolutionary policy search methods. 2019. ⟨hal-02397754⟩
[ HTTP ]
Marwen Belkaid, Jérémie Naudé, Philippe Faure, Olivier Sigaud. Modélisation des stratégies de génération de choix variables chez la souris. Conférence Nationale en Intelligence Artificielle, Jul 2019, Toulouse, France. ⟨hal-02328815⟩
[ HTTP | PDF ]
Cédric Colas, Pierre Fournier, Olivier Sigaud, Mohamed Chetouani, Pierre-Yves Oudeyer. CURIOUS: Intrinsically Motivated Modular Multi-Goal Reinforcement Learning. ICML 2019 - Thirty-sixth International Conference on Machine Learning, Jun 2019, Long Beach, United States. ⟨hal-01934921v2⟩
[ HTTP | PDF ]
Cédric Colas, Olivier Sigaud, Pierre-Yves Oudeyer. A Hitchhiker's Guide to Statistical Comparisons of Reinforcement Learning Algorithms. ICLR Worskhop on Reproducibility, May 2019, Nouvelle-Orléans, United States. ⟨hal-02369859⟩
[ HTTP | PDF ]
Olivier Sigaud, Freek Stulp. Policy search in continuous action domains: An overview. Neural Networks, 2019, 113, pp.28-40. ⟨10.1016/j.neunet.2019.01.011⟩. ⟨hal-02182466⟩
[ HTTP | PDF ]
Cédric Colas, Olivier Sigaud, Pierre-Yves Oudeyer. How Many Random Seeds? Statistical Power Analysis in Deep Reinforcement Learning Experiments. 2018. ⟨hal-01890154⟩
[ HTTP | PDF ]
Cédric Colas, Olivier Sigaud, Pierre-Yves Oudeyer. GEP-PG: Decoupling Exploration and Exploitation in Deep Reinforcement Learning Algorithms. International Conference on Machine Learning (ICML), Jul 2018, Stockholm, Sweden. ⟨hal-01890151⟩
[ HTTP | PDF ]

Page