“Ingredients for Motion Planning-powered Reinforcement Learning”, HDR defence of Nicolas Perrin-Gilbert

Category: Soutenance

Nicolas Perrin-Gilbert, CNRS research fellow, will be defending his habilitation to supervise research (HDR) on Thursday 17 October at 4.30pm at ISIR, on the Pierre and Marie Curie campus of Sorbonne University.

Title of work: “Ingredients for Motion Planning-powered Reinforcement Learning”.

The members of the jury are as follows:

– Aleksandra Faust, Google Deepmind (Reporter),

– Matthieu Geist, University of Lorraine & Cohere (Examiner),

– Nicolas Mansard, LAAS-CNRS ( Reporter),

– Jochen J. Steil, Technical University of Brunswick ( Reporter),

– Nicolas Thome, Sorbonne University (Examiner).

Summary:

My HDR focuses on four main contributions, quite distinct, but all having a link with a common objective, that of improving exploration in reinforcement learning through the use of movement planning techniques based on random sampling.

The first contribution concerns geometric transformations between continuous movements and discrete sequences of contacts, which enables us to approach the problems of generating locomotion movements from a new angle. The second contribution proposes a technique for generalising or imitating trajectories based on the application of successive diffeomorphic transformations. The third deals with the management of successive sequences of goals, in particular the notions of intermediate goals and ways of considering them in order to achieve an overall goal more efficiently. Finally, the fourth contribution presents an ‘off-policy’ reinforcement learning algorithm, aimed at improving the training of control policies when a significant part of the data comes from exploration trajectories.

Contact: Nicolas Perrin-Gilbert, CNRS Research Fellow

Published on 14/10/2024.