Exam: text in English; documents allowed; Internet not allowed.
Past exams: see the Google Drive for the subjects in English !
https://drive.google.com/drive/u/1/folders/1uOfxeRUkGlGMuRlQchjpqMsuTUi4HxQN
October 2nd, 2019
  1. Decision Trees
    1. Slides: DecisionTree_2019.pdf
  2. Validation
    1. Slides: main_Validation_2019.pdf
October 9th
  1. Support Vector Machines
    1. Slides: Slides_SVM_2019.pdf


Articles for seminar courses

You can now pick your time slot: here
Select a paper in pairs or solo (this you are supposed to have done already)
Prepare a written summary (2 pages pdf) - you will send it to us before
Prepare an oral (~10-15 slides is advised, 15 min presentation + 5 min questions = 20 min total time).
Papers in each bloc are related (they cite each other, improve on each other): discuss with the other pairs in order not to repeat same
things and make the presentations most interesting for the audience (you can share your summaries to know what each other is doing).
When the link is not there, google the title.
Names in red are the names of students who selected a paper (therefore, it's no longer "available"). To choose a paper, send us an email.
Papers added, Oct. 2019
First papers

EM and Bayes

  1. Naive Bayes models for probability estimation, ICML 2005 LOPEZ ESTIGARRIBIA, Pablo Jose

Decision trees

  1. Mining High-Speed Data Streams MLDM 2000 Robin de Groot, Jérémy Hutin
  2. Stress-Testing Hoeffding Trees PKDD 2005 Sara Días, Ricardo Rojas
  3. Extremely Fast Decision Tree Mining for Evolving Data Streams, KDD 2017 René Gómez , Ainhoa Zapirain

  1. Very Simple Classification Rules Perform Well on Most Commonly Used Datasets, MLJ 93,

  1. Box Drawings for Learning with Imbalanced Data, KDD 2014 Pratham Solanki,Edoardo Conte
  2. Learning Certifiably Optimal Rule Lists for Categorical Data, JMLR 2018

  1. Random Forests, MLJ 2001, LUO Hongyi LI Ziheng
  2. Feature-Budgeted Random Forest, ICML 2015, Amine Kili

Applications of decision trees.

  1. Decision tree and random forest models for outcome prediction in antibody incompatible kidney transplantation, Biomedical signal processing and control, 2019 Gabriela Martinez Lopera,Jose Badillo Puebla
  2. Using Cutting-Edge Tree-Based Stochastic Models to Predict Credit Risk, 2018 (google the title); Braulio Blanco Lambruschini, Tzu-Man Wu
  3. Machine Learning based Approach to Financial Fraud Detection..., 2017; Buse Ozer, Eugen Patrascu

Estimation for inference and ensembles of classifiers

  1. Experimental comparison between bagging and Monte Carlo ensemble classification, ICML 2005 Gaspard Donada Vidal


  1. Classifier Chains for Multi-label Classification, ECML PKDD 2009 Timothée Babinet
  2. An Analysis of Chaining in Multi-Label Classification, ECAI 2012 (google the title) Nicolas Devatine & Xinneng Xu

Area under the ROC curve

  1. Learning Mixtures of Localized Rules by Maximizing the Area Under the ROC Curve, ECAI 2004 Sebastien Warichet,Theo Deschamps-Berger
  2. Learning Interestingness Measures in Terminology Extraction: A ROC-based approach, ICDM 2004 Ramine HAMIDI, Florian Bertelli
  3. Model selection via the AUC, ICML 2004
  4. Optimizing abstaining classifiers using ROC analysis, ICML 2005


Finding good hyper-parameters

  1. Random search for hyper-parameter optimization, JMLR 2012, Gaetano D'Agostino and Francesco Cian
  2. Sequential model-based optimization for general algorithm configuration, LION 2011
  3. Collaborative hyperparameter tuning, ICML 13 Clément Veyssière, Eric Wang
  4. FREEZE-THAW BAYESIAN OPTIMIZATION, ArXiv 1014 Eva Agrawal, Wafa Bouzouita
  5. Speeding Up Automatic Hyperparameter Optimization of Deep Neural Networks by Extrapolation of Learning Curves, IJCAI 2015

Support Vector Machines

  1. Using String Kernels to Identify Famous Performers from Their Playing Style, ECML PKDD 2004 Woody HOJEILY
  2. Supervised clustering with support vector machines, ICML 2005, Moez Ezzeddine et Ghassen Chaabane
  3. Adapting two-class support vector classification methods to many class problems, ICML 2005
  4. A support vector method for multivariate performance measures, ICML 2005
  5. An efficient method for simplifying support vector machines, ICML 2005