Exposes

21 février 2018, Amphithéatre Shannon, batiment 660
23 février 2018, salle 2014, 2e etage, batiment Shannon, 660

https://docs.google.com/spreadsheets/d/1eidQleMOdpmXbr3tGaKUdhM0avHUGwBQTtW-lh2QrL4/edit?usp=sharing

Examen de l'an dernier

2016-2017 AIC_RL_Exam_16.pdf

Projects

Todo: code, experiments, analysis, written report.

Copy-paste of existing programs on the Net will have consequences, which could include receiving a mark of 0.

Projects involve at most 3 students except for the last two (Halite & Alesia: 4 students).
Projects are due on February, 15th, 23:59 GMT+1.

Each group must produce :

A report of circa 2 pages (max 3 pages without references), TeX and .pdf files, including a description of the approach, results and comparison with other algorithms/state of the art (when possible), using the ICML 2017 format. People not able to write TeX can produce a .doc(x) document, with its .pdf.( Description | ICML2017 TeX package )
The code of your implemented approach. This code should work "out of the box", add a notice/readme giving the list of required packages/libraries, special notes if needed. Producing a code taken from the internet, with none or little modifications could lead to unwanted consequences.

You can discuss about your project's problems/ideas, and ask for more information at : diviyan (at) lri (dot) fr

The subjects are the following (increasing difficulty):

Mountain car problem (compare two approaches)
Inverted pendulum (compare two representations of the problem)
The acrobot
Octopus
Td-gammon
bicycle: equilibrium + advancing
Anti-Imitation Policy learning: reproduce an experiment from mainDIVA.pdf
halite.io
Jeu d'Alesia (voir Approximate Dynamic Programming for Two-Player Zero-Sum Markov Games, ICML 15)Alesia_game.zip