Curriculum

Fabien Duchateau
Enseignant-chercheur (Maître de Conférences)

Bureau :Nautibus 12.057

Téléphone :+33 (0)4 72 44 58 25

Fax :+33 (0)4 72 43 15 36

Courriel :rf.1noyl-vinu [ta] uaetahcud.neibaf

Page web :http://liris.cnrs.fr/~fduchate/

Adresse :Bâtiment Nautibus
Campus de la Doua
8, Bd Niels Bohr
69622 Villeurbanne Cedex
France

Fabien Duchateau, 2011

Depuis Septembre 2011, je suis maître de conférences à l'Université Claude Bernard Lyon 1 pour l'enseignement et au LIRIS pour la partie recherche, dans l'équipe Base de Données.

En 2010, j'ai été sélectionné pour un contrat postdoctoral de 18 mois financé par l'institut de recherche européen ERCIM. La première partie de ce postdoc s'est effectuée au CWI, Pays-Bas, sous la supervision de Lynda Hardman. Ensuite, j'ai rejoint l'équipe de Trond Aalberg à NTNU, Norvège.

En Novembre 2009, j'ai obtenu mon doctorat en informatique au LIRMM, Université Montpellier II, France. Ma directrice de thèse est Zohra Bellahsene. Mon mémoire de thèse de doctorat s'intitule "Une Approche Générique pour la Sélection d'Outils de Découverte de Correspondances entre Schémas".

Mes principaux domaines de recherche sont l'intégration de données, l'alignement de schémas, d'ontologies et d'entités, le web sémantique, et l'apprentissage avec des applications aux systèmes d'informations géographiques et aux bibliothèques numériques.

Vous pouvez télécharger mon CV détaillé (actualisé en 2018).

Enseignement

Les unités d'enseignement suivantes sont détaillées sur le site du département informatique de l'UCBL.

Recherche

Publications

  • 1. Benchmarking and evaluating the interpretation of bibliographic records
    International Journal on Digital Libraries (IJDL), 2018
    Aalberg, Trond and Duchateau, Fabien and Takhirov, Naimdjon and Decourselle, Joffrey and Lumineau, Nicolas

    @ARTICLE{ijdl2018,
      author = {Aalberg, Trond and Duchateau, Fabien and Takhirov, Naimdjon and Decourselle, Joffrey and Lumineau, Nicolas},
      year = {2018},
      title = {Benchmarking and evaluating the interpretation of bibliographic records},
      journal = {International Journal on Digital Libraries (IJDL)},
      issn = {1432-5012},
      doi = {10.1007/s00799-018-0233-2},
      month = {1},
      pages = {1–23},
      url = {https://doi.org/10.1007/s00799-018-0233-2},
      notes = {Shared online version: http://rdcu.be/FSVv},
      keywords = {dataset, frbr, frbrization, migration, record interpretation},
      abstract = {In a global context which promotes the use of explicit semantics for sharing information and developing new services, the MAchine Readable Cataloguing (MARC) format that is commonly used by libraries worldwide has demonstrated its many limitations. The conceptual reference model for bibliographic information presented in the Functional Requirements for Bibliographic Records (FRBR) is expected to be the foundation for a new generation of catalogs that will replace MARC and the digital card catalog. The need for transformation of legacy MARC records to FRBR representation (FRBRization) has led to the proposal of various tools and approaches. However, these projects and the results they achieve are difficult to compare due to lack of common datasets and well defined and appropriate metrics. Our contributions fill this gap by proposing BIB-R, the first public benchmark for the FRBRization process. It is composed of two datasets that enable the identification of the strengths and weaknesses of a FRBRization tool. It also defines a set of well defined metrics that evaluate the different steps of the FRBRization process. Those resources, as well as the results of a large experiment involving three FRBRization tools tested against our benchmark, are available to the community under an open licence.},
    }

Voir mes publications sur HAL ou DBLP.

Projets

NomFinancementDatesPartenairesDescription
Home In Love (HiL)Labex IMU2017-2019CMW, GRePS, LIRIS, Home In LoveSystème de recommandation avec visualisation spatiale et non spatiale pour la recherche immobilière
DIRICKSPICS2015-2017NTNUIntégration, gardiennage et exploration de données culturelles sous forme de flux
MODALSPHC2015NTNUAlignement de connaissances: aspects distribués et passage à l'échelle
Collaboration industrielleANRT2014-2017ProgiloneThèse CIFRE "enrichissement sémantique d'entités culturelles"
UNIMAPLabex IMU2013-2016EVS, Rhône-Alpes Tourisme Intégration de services géo-localisés issus de plusieurs fournisseurs en vue d’obtenir une carte unifiée - Application aux points d’intérêts (POI) touristiques
KOGARPHC2013NTNUGestion de connaissances dans le web des données
Bonus qualité rechercheUCBL2013Gestion de la dynamicité et intégration sémantique dans un réseau de connaissances sur l’héritage culturel
Portefeuille / AdnoscoLIRIS2012-2013Gestion des données personnelles
FORUMANR2006 - 2009IRISA, LIRMM, LIRIS, LIMOS, CEMAGREFConception d'un système médiateur sémantique pour des applications gérant de grands volumes de données

Encadrements de stage et thèse de doctorat

De nombreux projets liés aussi bien à l'enseignement qu'à la recherche ne verraient pas le jour sans la participation des étudiant-e-s. Ci-dessous, les étudiant-e-s que j'ai (eu) le plaisir de (co-)encadrer.

Doctorat Joffrey Decourselle 2014 - 2018 LIRIS, France Enrichissement sémantique d'entités culturelles
Thèse de doctorat CIFRE avec l'entreprise Progilone
Bilal Berjawi 2013 - 2017 LIRIS, France Integration of Heterogeneous Data from Multiple Location-Based Services Providers: a Use Case on Tourist Points of Interest
Thèse de doctorat financée sur le projet UNIMAP
Thèse soutenue le 01/09/17 à l'INSA, Lyon
Naimdjon Takhirov 2011 - 2013 NTNU, Norvège Extracting Knowledge for Cultural Heritage Knowledge Base Population
PhD defense on 07/11/13 in NTNU, Trondheim, Norway
Master Mohamed Benaïssa 2014 LIRIS, France Enrichissement sémantique d’entités dans un contexte large échelle
Kamel Taouche 2014 LIRIS, France Optimisation et extension sémantique d'un algorithme de traitement de requêtes agrégatives
Licence Oliver Conus Automne 2013 LIRIS, France Projet KOGAR, développement d'un outil d'alignement sur un réseau pair à pair
Aurélien Chemier, Jonathan Cohen, Oliver Conus, Abdoulaye Keita, Vi-Nam Khuong, Lan Thao Le Thi, Quoc Vuong Nguyen Été 2013 UCBL, France Projet ADRess, développement d'une application de gestion de ressources

Prototypes

  • BIB-R: un benchmark pour l'évaluation des outils d'interprétation de notices bibliographiques (FRBRisation)

    BIB-R is a benchmark for the interpretation of bibliographic records. It provides two datasets (T42 and BIB-RCAT) dedicated to the evaluation of the FRBRization process. The goal T42 is to identify the weak and strong points of a tool by testing all possible issues that libraries may face during FRBRization. The second dataset BIB-RCAT is extracted from catalogs of three different cultural institutions and can be used for comparing or experimenting with the data quality and size of data that typically is found in real world catalogs. The expected FRBR results (gold standard) are included in these datasets to enable evaluation. The MARC catalogs are provided in MARC/XML format while the FRBR collections are available in RDF/XML (Generated by the Jena API).
    • 1. Open Datasets for Evaluating the Interpretation of Bibliographic Records
      Joint Conference on Digital Libraries (JCDL), 2016
      Decourselle, Joffrey and Duchateau, Fabien and Aalberg, Trond and Takhirov, Naimdjon and Lumineau, Nicolas

    • 2. BIB-R: A Benchmark for the Interpretation of Bibliographic Records
      Theory and Practice of Digital Libraries (TPDL), 2016
      Joffrey Decourselle and Fabien Duchateau and Trond Aalberg and Naimdjon Takhirov and Nicolas Lumineau


  • GeoBench: un outil d'intégration spatiale, pour construire un benchmark pour l'alignement d'entités spatiales ou simplement pour construire une carte avec des informations complètes sur ses lieux favoris
     
    GeoBench is a tool which aims at helping data integration researchers building spatial entity matching benchmarks. The main features of GeoBench is the discovery and the integration of corresponding spatial entities between different cartographic providers (currently Geonames, Here and Google Maps). It can also be used by end-users to build a customized map with complete information about their favourite restaurants, hotels, etc.
    • 1. GeoBench: a Geospatial Integration Tool for Building a Spatial Entity Matching Benchmark
      International Conference on Advances in Geographic Information Systems (SIGSPATIAL), 2014
      Anthony Morana and Thomas Morel and Bilal Berjawi and Fabien Duchateau

    • 2. PABench: Designing a Taxonomy and Implementing a Benchmark for Spatial Entity Matching
      International Conference on Advanced Geographic Information Systems, Applications, and Services (GEOProcessing), 2015
      Bilal Berjawi and Fabien Duchateau and Franck Favetta and Maryvonne Miquel and Robert Laurini


  • KIEV: un outil d'extraction de relations binaires dans des documents textuels

    KIEV (a.k.a. SPIDER) aims at extracting binary relationships from textual documents to populate semantic triple stores or large knowledge bases. It combines a semantic part (extension of labels, clustering of frequent terms) with natural language processing techniques (Part Of Speech tagging) to generate relevant patterns for a specific type of relationship. Three use cases are presented: the former discovers the type(s) of relationship between two given entities. The second use case finds all related entities given an initial entity and a type of relationship. The latter discovers examples (i.e., pairs of entities) which respect a given type of relationship.
    • 1. KIEV: a Tool for Extracting Semantic Relations from the World Wide Web
      International Conference on Extending Database Technology (EDBT), 2014
      Naimdjon Takhirov and Fabien Duchateau and Trond Aalberg and Ingeborg Solvberg

    • 2. An Integrated Approach for Large-Scale Relation Extraction from the Web
      Asia-Pacific Web Conference (APWeb), 2013
      Naimdjon Takhirov and Fabien Duchateau and Trond Aalberg and Ingeborg Solvberg

    • 3. An Evidence-based Verification Approach to Extract Entities for Knowledge Base Population
      International Semantic Web Conference (ISWC), 2012
      Naimdjon Takhirov and Fabien Duchateau and Trond Aalberg


  • Reperage: un outil de repérage urbain à travers la prise de points de repère

    Le projet "questions de repérage" a pour but d'analyser comment une personne se repère en ville. En utilisant l'interface web, un.e utilisateur.ice peut produire une carte personnalisée, représentant ses quartiers favoris et les éléments géographiques qui l'aident à s'y localiser.
    • 1. Outil de repérage urbain à travers la prise de points de repère
      Laboratoires EVS et LIRIS (technical report), 2013
      Bilal Berjawi and Maxime Colomb and Thierry Joliveau and Franck Favetta and Fabien Duchateau and Maryvonne Miquel


  • FRBRpedia: un plugin pour convertir un produit vendu sur le Web dans le modèle FRBR et en connectant les entités générées au Linked Open Data cloud (LOD)

    FRBRpedia is a tool to FRBRize Web products, i.e., to convert them into the FRBR model. This implies the detection of the artistic Work, Agents (e.g., author, translator, illustrators), Expressions and Manifestations from the product. In addition, we link the artistic Work to Linked Open Data for semantic enrichment. Our online application currently enables the FRBRization of Amazon products and the linking to DBpedia.
    • 1. FRBRPedia: a Tool for FRBRizing Web Products and Linking FRBR Entities to DBpedia
      Joint Conference on Digital Libraries (JCDL), 2011
      Fabien Duchateau and Naimdjon Takhirov and Trond Aalberg

    • 2. Supporting FRBRization of Web Product Descriptions
      Theory and Practice of Digital Libraries (TPDL), 2011
      Naimdjon Takhirov and Fabien Duchateau and Trond Aalberg

    • 3. Linking FRBR Entities to LOD through Semantic Matching
      Theory and Practice of Digital Libraries (TPDL), 2011
      Naimdjon Takhirov and Fabien Duchateau and Trond Aalberg

    • 4. FRBR-ML: A FRBR-based Framework for Semantic Interoperability
      Journal of Semantic Web, 2012
      Naimdjon Takhirov and Fabien Duchateau and Trond Aalberg and Maja Zumer


  • FRBRizer: un outil de conversion des données bibliographiques au format MARC vers le modèle FRBR

    FRBRizer (a.k.a. marc2frbr) supports the conversion of records in the MARC format to a normalized set of records based on the FRBR model. It utilizes a set of rules encoded in an XML file. An example is provided for MARC21, but the tool is generic enough and can easily be adapted to other dialects of MARC.
    • 1. FRBRPedia: a Tool for FRBRizing Web Products and Linking FRBR Entities to DBpedia
      Joint Conference on Digital Libraries (JCDL), 2011
      Fabien Duchateau and Naimdjon Takhirov and Trond Aalberg

    • 2. Supporting FRBRization of Web Product Descriptions
      Theory and Practice of Digital Libraries (TPDL), 2011
      Naimdjon Takhirov and Fabien Duchateau and Trond Aalberg

    • 3. FRBR-ML: A FRBR-based Framework for Semantic Interoperability
      Journal of Semantic Web, 2012
      Naimdjon Takhirov and Fabien Duchateau and Trond Aalberg and Maja Zumer


  • YAM: une fabrique d'outils de mise en correspondance de schémas basée sur des techniques d'apprentissage pour combiner efficacement les mesures de similarité

    YAM (Yet Another Matcher) is not (yet) another schema matching system as it enables the generation of "a la carte" schema matchers according to user requirements. These requirements include a preference for recall or precision, a training data set (schemas already matched) and provided expert mappings. YAM uses a knowledge base that includes a (possibly large) set of similarity measures and classifiers. Based on the user requirements, YAM learns how to best apply these tools (similarity measures and classifiers) in concert to achieve the best matching quality.
    • 1. YAM: A Step Forward for Generating a Dedicated Schema Matcher
      Trans. Large-Scale Data- and Knowledge-Centered Systems (TLDKS), 2016
      Fabien Duchateau and Zohra Bellahsene

    • 2. (Not) Yet Another Matcher
      Conference on Information and Knowledge Management (CIKM), 2009
      Fabien Duchateau and Remi Coletta and Zohra Bellahsene and Renée J. Miller

    • 3. YAM: a Schema Matcher Factory
      Conference on Information and Knowledge Management (CIKM), 2009
      Fabien Duchateau and Remi Coletta and Zohra Bellahsene and Renée J. Miller

    • 4. Encore un outil de découverte de correspondances entre schémas XML?
      Bases de Données Avancées (BDA), 2009
      Fabien Duchateau and Remi Coletta and Zohra Bellahsene and Renée J. Miller


  • MatchPlanner: un outil de mise en correspondance de schémas qui combine les mesures de similarité au moyen d'un arbre de décision

    MatchPlanner uses a decision tree to combine the most appropriate similarity measures for a given domain. As a first consequence of using the decision tree for matching schemas, the time performance of the system is improved since the complexity is bounded by the height of the tree. The second advantage deals with the mappings quality. Indeed, for a given domain, only the most suitable similarity measures are used. Finally, MatchPlanner is also able to learn new decision trees, thus automatically tuning the system for providing optimal configuration for a given matching scenario.
    • 1. A Flexible Approach for Planning Schema Matching Algorithms
      OTM Conferences, CooPerative Information Systems (CooPIS), 2008
      Fabien Duchateau and Zohra Bellahsene and Remi Coletta


  • XBenchMatch: un benchmark pour évaluer les outils de mise en correspondance de schémas

    XBenchMatch is a benchmark involving a set of criteria for testing and evaluating schema matching tools. We focus on the assessment of the matching tools in terms of matching quality and time performance. We also provide a testbed involving a large schema corpus that can be used by everyone to quickly benchmark their new schema matching algorithms. Finally, new metrics have been proposed to evaluate the quality of an integrated schema.
    • 1. XBenchMatch: a Benchmark for XML Schema Matching Tools
      Very Large DataBases (VLDB), 2007
      Fabien Duchateau and Zohra Bellahsene and Ela Hunt

    • 2. Measuring the Quality of an Integrated Schema
      Conference on Conceptual Modelling (ER), 2010
      Fabien Duchateau and Zohra Bellahsene

    • 3. Matching and Alignment: What is the Cost of User Post-match Effort?
      OTM Conferences, CooPerative Information Systems (CooPIS), 2011
      Fabien Duchateau and Zohra Bellahsene and Remi Coletta

    • 4. On Evaluating Schema Matching and Mapping
      Schema Matching and Mapping, 2011
      Angela Bonifati and Zohra Bellahsene and Fabien Duchateau and Yannis Velegrakis

    • 5. Designing a Benchmark for the Assessment of Schema Matching Tools
      Open Journal of Databases (OJDB), 2014
      Fabien Duchateau and Zohra Bellahsene


  • BMatch: un outil de mise en correspondance de schémas qui implémente une mesure de similarité structurelle et un arbre B- comme structure d'indexation

    BMatch has been designed to discover mappings between schemas. Its semantic aspect consists in combining both terminological and structural similarity measures. Terminological measures enable the discovery of mappings whose schema elements share similar labels. Conversely, structural measures, based on cosine measure, detects mappings when schema elements have the same neighbourhood. BMatch's second aspect aims at improving the time performance by using an indexing structure, the B-tree, to accelerate the schema matching process. Indeed, we cluster schema element's labels which share the same tokens to reduce search space during matching.
    • 1. A Context-based Measure for Discovering Approximate Semantic Matching between Schema Elements
      Research Challenges in Information Science (RCIS), 2007
      Fabien Duchateau and Zohra Bellahsene and Mathieu Roche

    • 2. An Indexing Structure for Automatic Schema Matching
      International Conference on Data Engineering (ICDE) - Workshops, 2007
      Fabien Duchateau and Zohra Bellahsene and Mark Roantree and Mathieu Roche

    • 3. BMatch: a Semantically Context-based Tool Enhanced by an Indexing Structure to Accelerate Schema Matching
      Base de Données Avancées (BDA), 2007
      Fabien Duchateau and Zohra Bellahsene and Mathieu Roche

    • 4. Improving quality and performance of schema matching in large scale
      Ingénierie des Systèmes d'Information, 2008
      Fabien Duchateau and Zohra Bellahsene and Mathieu Roche