Title: European Environment Agency and Linked Environment Data and how we are implementing SEIS
1European Environment AgencyandLinked
Environment Dataandhow we are implementingSEIS
2The current situation
3The current situation
4The current situation
- Find dataset
- Download it
- Import it
5The current situation
- Find dataset
- Download it
- Import it
- Clean it
6The current situation
- Find dataset
- Download it
- Import it
- Clean it
- Create chart
7Vision statement
- If SEIS is only about making data public and not
the rest, we wouldnt get much benefit! - We want to eliminate all steps but the last!
- ...And were going to use Linked Data technology
to do it
8Solution to the data format problem
- In addition to the HTML for human eyes were
asking for a new format called RDF that machines
can understand - It is a modernisation of CSV, Excel and all the
other data dump formats - This is all we ask a producer to provide... and
some metadata
9No more searching on foreign sites
- The remote nodes provide lists of their datasets
- Called manifests or semantic sitemaps
- Also in RDF format
- Controlled vocabulary URLs in metadata
- Use any of GEMET / AgroVoc / DBPedia / EuroVoc /
UMTHES we have created equivalence links
between them - The manifests are loaded into our Linked Data
search engine
10Downloading made easy!
Click on the title to see if it is in the database
11Downloading made easy
Seconds later...
12Status
- EEA has deployed two search engines called
Content Registry and Semantic Data Service that
import all lists and all data - Content Registry is for Reportnet deliveries
- Semantic Data Service is for published datasets
- We have created RDF of several data sets
Reportnet, GEMET, EUNIS, ROD, ITIS, NUTS, NACE
etc. - We can also load Eurostat SDMX data via the LATC
project
13Queries
14Example of SPARQL query
- Future prospects for the European otter
- (From Reportnet)
- PREFIX art17 lthttp//rdfdata.eionet.europa.eu/art
17/ontology/gt - PREFIX eea lthttp//rdfdata.eionet.europa.eu/eea/o
ntology/gt -
- SELECT ?country ?region ?future WHERE
- art17forSpecies lthttp//eunis.eea.europa.eu/
species/1435gt - art17hasRegionalReport ?report.
- ?report art17conclusion_future ?future
- art17forCountry ?curl
- art17region ?bgregion.
- ?bgregion eeaname ?region.
- ?curl eeaname ?country
- ORDER BY ?country ?region
15Result Future of the European otter
country region future
Austria Alpine Inadequate (U1)
Austria Continental Inadequate (U1)
Belgium Atlantic Bad (U2)
Belgium Continental Bad but improving (U2)
Czech Republic Continental Favourable (FV)
Czech Republic Pannonian Favourable (FV)
Estonia Boreal Favourable (FV)
16Comparing data Where do EUNIS and ITIS not agree
on naming?
- PREFIX e lthttp//eunis.eea.europa.eu/rdf/species-
schema.rdfgt - PREFIX itis lthttp//eunis.eea.europa.eu/rdf/schem
a.rdfgt - PREFIX dwc lthttp//rs.tdwg.org/dwc/terms/gt
-
- SELECT ?eunisname ?eunisauthor ?itisname
?itisauthor ?usage WHERE - ?eunisurl evalidName 1
- esameSynonym ?itisurl
- ebinomialName ?eunisname
- dwcscientificNameAuthorship
?eunisauthor. - ?itisurl itisnameUsage "invalid",?usage
- itiscompletename ?itisname
- itishasAuthor ?auurl.
- ?auurl itisshortAuthor ?itisauthor
-
17Results
eunisname eunisauthor itisname itisauthor usage
Chondrocladia alaskensis Lambe,1900 Chondrocladia alaskensis Lambe 1895 invalid
Myxilla parasitica (Lambe,1900) Myxilla parasitica Lambe 1893 invalid
Hymedesmia primitiva Lundbeck,1910 Hymedesmia primitiva Lundbeck 1910 invalid
Asbestopluma lycopodium (Levinsen,1886) Asbestopluma lycopodium Levinsen 1886 invalid
Esperiopsis rigida Lambe,1900 Esperiopsis rigida Lambe 1893 invalid
Cordylophora lacustris Allman, 1844 Cordylophora lacustris Allman 1844 invalid
18Visualisations
19Water use per NUTS level 2 in 2007Top 20
Combination of two Eurostat SDMX datasets
20- PREFIX qb lthttp//purl.org/linked-data/cubegt
- PREFIX e lthttp//ontologycentral.com/2009/01/euro
stat/nsgt - PREFIX sdmx-measure lthttp//purl.org/linked-data/
sdmx/2009/measuregt - PREFIX skos lthttp//www.w3.org/2004/02/skos/core
gt - PREFIX g lthttp//eurostat.linked-statistics.org/o
ntologies/geographic.rdfgt - PREFIX dataset lthttp//eurostat.linked-statistics
.org/data/gt - SELECT ?nuts2
- SUM(xsddecimal(?obsvalue)) AS ?population
- ?wateruse
- xsddecimal(?wateruse)1000000/SUM(xsddeci
mal(?obsvalue)) AS ?percapita - WHERE
- ?observation qbdataset datasetdemo_r_pjanaggr3
- etime lthttp//eurostat.linked-statistics.
org/dic/time2007gt - eage lthttp//eurostat.linked-statistics.o
rg/dic/ageTOTALgt - esex lthttp//eurostat.linked-statistics.o
rg/dic/sexTgt - egeo ?ugeo
- sdmx-measureobsValue ?obsvalue.
- ?ugeo ghasParentRegion ?parent.
21GHG per capita 1990-2009
22- PREFIX qb lthttp//purl.org/linked-data/cubegt
- PREFIX e lthttp//ontologycentral.com/2009/01/euro
stat/nsgt - PREFIX sdmx-measure lthttp//purl.org/linked-data/
sdmx/2009/measuregt - PREFIX skos lthttp//www.w3.org/2004/02/skos/core
gt - PREFIX g lthttp//eurostat.linked-statistics.org/o
ntologies/geographic.rdfgt - PREFIX dataset lthttp//eurostat.linked-statistics
.org/data/gt - SELECT ?country ?year ?population ?ghgtotal
- xsddecimal(?ghgtotal)1000/(xsddecim
al(?population)) AS ?percapita - FROM lthttp//eurostat.linked-statistics.org/data/d
emo_pjanbroad.rdfgt - FROM lthttp//eurostat.linked-statistics.org/data/e
nv_air_gge.rdfgt - FROM lthttp//semantic.eea.europa.eu/home/roug/euro
statdictionaries.rdfgt - WHERE
- ?popobs qbdataset datasetdemo_pjanbroad
- etime ?uyear
- efreq lthttp//eurostat.linked-statistics.
org/dic/freqAgt - eage lthttp//eurostat.linked-statistics.o
rg/dic/ageTOTALgt - esex lthttp//eurostat.linked-statistics.o
rg/dic/sexTgt - egeo ?ucountry
23The end
- Søren Roug
- European Environment Agency
- Soren.Roug_at_eea.europa.eu