Interroger le Web de données¶

Interroger le Web de données

UDOS - 30 juin 2017

Pierre-Antoine Champin (UCBL/LIRIS)

http://champin.net/2017/udos

Web de données

Qu'est-ce que le Web?

Un espace documentaire

décentralisé (HTTP)
interconnecté (URL)
interopérable (HTML)

Qu'est-ce que le Web?

Un espace documentaire

décentralisé (HTTP 2014, HTTP 2.0)
interconnecté (URL, URI, IRI)
interopérable (HTML5, CSS, JS)

Limites

Combien de scientifiques n’ont jamais reçu de prix Nobel personnellement, mais ont “produit” au moins trois prix Nobel différents par étudiant interposé ?

<http://yasgui.org/short/B1EEbOQN->

Vers un Web de données

Un espace de données

décentralisé (HTTP)
interconnecté (URL)
interopérable (?)

Vers un Web de données

Un espace de données

décentralisé (HTTP)
interconnecté (URL)
interopérable (RDF)

RDF

Vocabulaire / Ontologie

Ensemble d’IRIs décrivant
- les catégories du domaine (classes)
- les propriétés (attributs, relations)
- les objets du domain

SPARQL

Principe

Décrire les relations entre les données que l’on cherche sous forme d’un sous-graphe.

Préfixes

PREFIX s: <http://schema.org/>
PREFIX owl: <http://www.w3.org/2002/07/owl#>
# ...

s:Event → <http://schema.org/Event>

s:startDate → <http://schema.org/startDate>

s:endDate → <http://schema.org/endDate>

owl:sameAs → <http://www.w3.org/2002/07/owl#sameAs>

...

Termes

`s:Event`	IRI abrégé
`<http://example.org/>`	IRI complet
`"hello world"`	Litéral chaîne
`42 3.14 1e-10`	Litéraux numérique
`?x`	Variable

Triplet

Chaque arc du graphe est représenté par un triplet :

<http://champin.net/#pa> rdf:type s:Person .

Graphe

Le graphe requête est l’union des arcs.

<http://champin.net/#pa> rdf:type s:Person .
<http://champin.net/#pa> foaf:knows ?x .
?x foaf:surname "Zigmann" .

Factorisation (1)

Même sujet :

<http://champin.net/#pa>
    rdf:type s:Person ;
    foaf:knows ?x .

Factorisation (2)

Même sujet et même prédicat :

<http://champin.net/#pa> foaf:knows ?x, ?y .

Sous-graphe optionnel

<http://champin.net/#pa> foaf:knows ?x.
OPTIONAL { ?x schema:address ?addr }

par opposition à

<http://champin.net/#pa> foaf:knows ?x.
?x schema:address ?addr .

Filtres

<http://champin.net/#pa> foaf:knows ?x.
?x s:birthDate ?bd.
FILTER ( STR(?bd) > "1999-06" )

Sélection

SELECT ?p ?n {
   <http://champin.net/#pa> foaf:knows ?x .
    ?x foaf:givenName ?p ;
       foaf:suename ?n ;
       foaf:age ?a .

    FILTER (?a >= 18)
}

À vous de jouer

http://champin.net/2017/udos/tuto

Pour aller plus loin

Trouver des sources de données

Datahub
SPARQL endpoint status
follow your nose with VOID

Yasgui, fonctions avancées

représentations graphiques
tableau croisé

Requêtes fédérées

# (... déclarations de préfixe ...)
SELECT ?s {
  [] a nobel:NobelPrize ;
     nobel:category cat:Physics ;
     nobel:year 1965 ;
     nobel:laureate ?l .
  ?l foaf:name ?name ; owl:sameAs ?iri .

  SERVICE <http://dbpedia.org/sparql> {
    ?iri dbo:doctoralStudent ?s .
  }
}

<http://yasgui.org/short/r1S4PLfE->

Linked Data Fragments

http://linkeddatafragments.org/

Interroger le Web de données¶