User Tools

Site Tools


Sidebar

Practical Information:

Teaching:

Bâtiment Nautibus
43, Bd du 11 Novembre 1918
69622 Villeurbanne Cedex.
☏: +33(0)472 43 16 35
email: marc.plantevit-at-univ-lyon1.fr

Research:

Bureau 501.319
Bâtiment Blaise Pascal
7, Avenue Jean Capelle
69621 Villeurbanne Cedex
☏: +33(0)472 43 84 87
Fax: +33(0)472 43 87 13
email: marc.plantevit-at-liris.cnrs.fr

m1ens2016_project

M1ENS -- DBDM -- DM Project

The goal of this project is to apply the concepts and the technologies previously seen. To this end, you have to choose a public (or personal) data set that has to be validated.

On the considered dataset, you have to either bring some insights according to an already given task (e.g., classification task) or define your self the general aims and discover some knowledge from the data (produce added value from the data). To this end, you can use any data mining/machine learning method as well as any algorithm or software (Knime, Sci-Kit Learn (Python), Web Api (Google, Bing, Yahoo, …)).

Datasets

Datasets Possible Mining Task
Datasets available on http://www.kaggle.com/ the related aims or other ones, I have to valid your choice
Other datasets you want I have to valid your choice

<note important>The dataset and the main goals must be validated on April 4th, or by email. </note>

Tentative schedule

GIDSubjectTime
G1P. Simonaitis & D. LajouFootball data challenge 13h30
G2M. Boritchev & M. ChardetSan Francisco Crimes 13h45
G3J-Y. Franceschi, F. Lebeau & V. MollimardLoL14h
G4B. Brikci-Sid, S. Tendjaoui & H. YampaDetecting gender from micro-reviews14h15
G5V. Michielini, E. Moutot & E. OshurkoDeath cause prediction14h30
G6R. Grünblatt, S. Mauras & X. VuDistrict mapping from social media14h45
G7C. Lucas 15h

Expectations

You have – using the different concepts seen during the lectures (but not uniquely) – produce added value from data (answer the a specific question, discover knowledge, …). You can use any tools/techno/algorithms.

You have to:

  • Write a report (pdf format) describing your work;
  • Give me an archive of your code;
  • Present your work on April 25th: a 10-minute presentation followed by questions (5 minutes)

<note important> The report, presentation and source code must be sent by email (marc.plantevit-at-liris.cnrs.fr) before 04/26/2016 (23h59) 1). </note>

<note important> You can work in group of maximum 3 persons.

  • Expected work = f(|group|) with f strictly increasing ;-).

</note>

1)
If the archive is too big, provide a link to download it.
m1ens2016_project.txt · Last modified: 2016/04/20 08:41 by mplantev

CNRS INSA de Lyon Université Lyon 1 Université Lyon 2 École centrale de Lyon