Title: Towards a new generation of semantic web applications
1Towards a new generation ofsemantic web
applications
- Prof. Enrico Motta, PhDKnowledge Media
InstituteThe Open UniversityMilton Keynes, UK
2The Semantic Web
- A large scale, heterogenous collection of formal,
machine processable, web accessible,
ontology-based statements (semantic metadata)
about web resources and other entities in the
world, expressed in a XML-based syntax
3The Semantic Web (pragmatic def.)
- The collection of all statements expressed in one
of the following formalismsOWL, RDF, DAML,
DAMLOIL, RDF-A, which can be accessed on the
web
4hasAffiliation
Organization
Person
Ontology
worksInOrgUnit
partOf
hasJobTitle
String
Organization-Unit
Metadata
5Ontology
Metadata
ltRDF triplegt ltRDF triplegt ltRDF triplegt ltRDF
triplegt ltRDF triplegt ltRDF triplegt
ltRDF triplegt ltRDF triplegt ltRDF triplegt ltRDF
triplegt ltRDF triplegt ltRDF triplegt
ltRDF triplegt ltRDF triplegt ltRDF triplegt ltRDF
triplegt ltRDF triplegt ltRDF triplegt
UoD
6Proposition 1
- The SW today has already reached a level of scale
good enough to make it a very useful source of
knowledge to support intelligent applications - This is unprecedented in the history of AI
7So, let's have a look at the semantic web as
it is today.
8Charting the web
9Charting the web (2)
10(No Transcript)
11(No Transcript)
12(No Transcript)
13(No Transcript)
14(No Transcript)
15(No Transcript)
16(No Transcript)
17Proposition 2
- The SW may well provide a solution to one of the
classic AI challenges how to construct and
manage large volumes of knowledge to construct
truly intelligent problem solvers and address the
brittleness of traditional KBS
181982
19Knowledge Representation Hypothesis
- Any mechanically embodied intelligent process
will be comprised of structural ingredients that - we as external observers naturally take to
represent a propositional account of the
knowledge that the overall process exhibits, and - independent of such external semantic
attribution, play a formal but causal and
essential role in engendering the behaviour that
manifests that knowledge - Brian Smith, 1982
20Intelligence as a function of possessing domain
knowledge
KA Bottleneck
Intelligent Behaviour
21The Knowledge Acquisition Bottleneck
KA Bottleneck
Intelligent Behaviour
22(No Transcript)
23SW as Enabler of Intelligent Behaviour
Intelligent Behaviour
24KBS vs SW Systems
Classic KBS SW Systems
Representation 'Clean' 'Dirty'
Size Small/Medium Extra Huge
Repr. Schema Homogeneous Heterogeneous
Quality High Very Variable
Degree of trust High Very Variable
25Key Paradigm Shift
Classic KBS SW Systems
Intelligence A function of sophisticated, task-centric problem solving A side-effect of size and heterogeneity (Collective Intelligence)
26Overall Goal
Our research programme is to contribute to the
development of this large-scale web of data and
develop a new generation of web applications able
to exploit it to provide intelligent
functionalities
27So, how can we exploit this emerging, large scale
semantic resource?
28Ontology Matching
29New paradigm use of background knowledge
Background Knowledge (external source)
R
B
A
A
B
30External Source One Ontology
- Aleksovski et al. EKAW06
- Map (anchor) terms into concepts from a richly
axiomatized domain ontology - Derive a mapping based on the relation of the
anchor terms
Assumes that a suitable (rich, large) domain
ontology (DO) is available.
31External Source Web
- van Hage et al. ISWC05
- rely on Google and an online dictionary in the
food domain to extract semantic relations between
candidate terms using IR techniques
OnlineDictionary
Does not rely on a rich Domain Ont,
IR Methods
Precision increases significantly if domain
specific sources are used 50 - Web 75 -
domain texts.
rel
A
B
32External Source SW
- Proposal
- rely on online ontologies (Semantic Web) to
derive mappings - ontologies are dynamically discovered and
combined
Semantic Web
Does not rely on any pre-selected knowledge
sources.
rel
A
B
M. Sabou, M. dAquin, E. Motta, Using the
Semantic Web as Background Knowledge inOntology
Mapping", Ontology Mapping Workshop, ISWC06.
Best Paper Award
33- How to combine online ontologies to derive
mappings?
34Strategy 1 - Definition
Find ontologies that contain equivalent classes
for A and B and use their relationship in the
ontologies to derive the mapping.
For each ontology use these rules
B1
B2
Bn
Semantic Web
An
A1
A2
O2
On
O1
These rules can be extended to take into account
indirect relations between A and B, e.g.,
between parents of A and B
rel
A
B
35Strategy 1- Examples
36Strategy 2 - Definition
Principle If no ontologies are found that
contain the two terms then combine information
from multiple ontologies to find a mapping.
Details (1) Select all ontologies containing
A equiv. with A (2) For each ontology
containing A (a) if find
relation between C and B. (b) if
find relation between C and B.
Details (1) Select all ontologies containing
A equiv. with A (2) For each ontology
containing A (a) if find
relation between C and B. (b) if
find relation between C and B.
B
rel
C
Semantic Web
rel
B
C
A
rel
A
B
37Strategy 2 - Examples
Ex1
Vs.
(r1)
(midlevel-onto)
(Tap)
(Same results for Duck, Goose, Turkey)
Vs.
Ex2
(pizza-to-go)
(r1)
(SUMO)
Vs.
Ex3
(pizza-to-go)
(r3)
(wine.owl)
38Large Scale Evaluation
Matching AGROVOC (16k terms) and NALT(41k terms)
(derived from 180 different ontologies)
Evaluation 1600 mappings, two teams Average
precision 70 (comparable to best in class)
M. Sabou, M. dAquin, W.R. van Hage, E. Motta,
Improving Ontology Matching by Dynamically
Exploring Online Knowledge.
39Chart 2
40Proposition 3
- Using the SW to provide dynamically background
knowledge to tackle the Agrovoc/NALT mapping
problem provides the first ever test case in
which the SW, viewed as a large scale
heterogeneus resource, has been successfully used
to address a real-world problem
41Next Generation Semantic Web Applications
NG SW Application
- Able to exploit the SW at large
- Dynamically retrieving the relevant semantic
resources - Combining several, heterogeneous Ontologies
42Contrast with 1st generation SW Applications
- Typically use a single ontology
- Usually providing a homogeneous view over
heterogeneous data sources. - Limited use of existing SW data
- Typically closed to semantic resources
1st generation SW applications are far more
similar to traditional KBS (closed semantic
systems) than to 'real' SW applications (open
semantic systems)
43It is still early days..
1895
2007
44Current Gateway to the Semantic Web
45Limitations of Swoogle
- Very limited quality control mechanisms
- Many ontologies are duplicated
- No quality information provided
- Limited Query/Search mechanisms
- Only keyword search no distinction between types
of elements - need for more powerful query methods (e.g.,
ability to pose formal queries ability to
distinguish between classes and instances, etc) - Limited range of ontology ranking mechanisms
- Swoogle only uses a 'popularity-based' one
- No support for ontology modularization
46A New Gateway to the Semantic Web
47Ontology Structuring Relations
inconsistent-with
extends
48Ontology Structuring Relations
inconsistent-with
Inconsistent-with
extends
49Formal Queries and relation discovery
50Current state of Watson
- Initial version implemented
- Demo version available online
- See http//watson.kmi.open.ac.uk/
- However still rather unstable..
- Stable version to be available within 4-6 weeks
- Initial crawl of the SW has already produced
interesting results.
51Some initial figures
- Lots of ontologies are in OWL FULL (3x the number
of OWL Lite) - but most of the ontologies use only a very
restricted sub-part of the expressivity of OWL
and DAML, e.g., - only 147 go beyond ALC
- role transitivity is used in only 11
ontologies.. - Almost 20 of semantic resources appear to be
duplicates
52Next Generation Semantic Web Applications
PowerMagpie
PowerAqua
53Folksonomies
Tags are great to organize data!!!
But they dont help much when searching
54Finding tagged images
55Finding tagged images FLOWER
56What if
folksonomies were semantically richer
Flower
Tulip
Lilac
Rose
57Finding tagged images FLOWER (II)
58Learning Relations Between Tags
Tags
NLP/Clustering
camera, digital, photograph damage, flooding,
hurricane, katrina, Louisiana
Clusters
Find and combine Online ontologies
Ontology
L.Specia, E. Motta, "Integrating Folksonomies
with the Semantic Web", ESWC 2007.
59In More Detail
60Examples
61Examples
62Examples
63Key Research Tasks
- Overall Infrastructure
- crawling, storing, structuring, querying the SW
- Ontology Selection
- In the context of dynamically identifying the
sources of knowledge relevant to the needs of a
system - Ontology Mapping
- When integrating information from different
ontologies - When mapping query/specs to ontologies
- Ontology Modularization
- Find the sub-modules relevant to a system's
query. - Semantic Markup Generation
- From various types of sources
64New task context
- Key point is that NG-SW applications require
solutions in a new dynamic context (run-time
rather than design-time) - Example Ontology Mapping
- Much current work focuses on design-time mapping
of complete ontologies - Example Ontology Selection
- Current work focuses on user-mediated ontology
selection - Example Ontology Modularization
- Current work by and large assumes that the user
is in the loop
65References
- Ontology Selection
- Sabou, M., Lopez, V., Motta, E. (2006). "Ontology
Selection for the Real Semantic Web How to Cover
the Queens Birthday Dinner?". Proceedings of
EKAW 2006 - Ontology Modularization
- D'Aquin, M., Sabou, M., Motta, E. (2006).
"Modularization A key for the dynamic selection
of relevant knowledge components". ISWC 2006
Workshop on Ontology Modularization - Watson
- dAquin, M., Sabou, M., Dzbor, M., Baldassarre,
C., Gridinoc, L., Angeletou, S. and Motta, E.
"WATSON A Gateway for the Semantic Web". Poster
Session at ESWC 2007
66References (2)
- Ontology Mapping
- Lopez, V., Sabou, M., Motta, E. (2006). "Mapping
the real semantic web on the fly". ISWC 2006 - Sabou, M., D'Aquin, M., Motta, E. (2006). "Using
the semantic web as background knowledge for
ontology mapping". ISWC 2006 Workshop on Ontology
Mapping. - Intg. of folksonomies and SW
- L.Specia, E. Motta, "Integrating Folksonomies
with the Semantic Web", ESWC 2007.
67'Vision' Papers
- Motta, E., Sabou, M. (2006). "Next Generation
Semantic Web Applications". 1st Asian Semantic
Web Conference, Beijing. - Motta, E., Sabou, M. (2006). "Language
Technologies and the Evolution of the Semantic
Web". LREC 2006, Genoa, Italy. - Motta, E. (2006). "Knowledge Publishing and
Access on the Semantic Web A Socio-Technological
Analysis". IEEE Intelligent Systems, Vol.21, 3,
(88-90).
68Conclusions
- SW provides an unprecedented opportunity to build
a new generation of intelligent systems, able to
exploit large scale, heterogeneous KBs - This new class of systems is fundamentally
different in many respects both from traditional
KBS and even from early SW applications - The size of the SW is increasing steadily and the
infrastructure is getting more and more robust.
These developments should enable more and more
new generation SW applications to emerge within
2-3 years
69(No Transcript)
70Current Gateway to the Semantic Web