Format des notes
Numérique sur 20
Programme détaillé
The course is roughly organised into four approaches to the theme of depth in Deep Reinforcement Learning:,
1\. Depth in value function (DQN and variants, distributional RL, ...),
2\. Depth in policy (PPO, SAC, imitation learning, ...),
3\. Depth in environment model (Monte Carlo Tree Search, model-based reinforcement learning),
4\. Depth in reward model (reward shaping, inverse reinforcement learning, transfer learning ...).Mots clés
apprentissage par renforcement, agents autonomes, prise de décision probabiliste, agents intelligents, prise de décision séquentielle