{Ontology: Resource} x {Matching : Mapping} x {Schema : Instance} :: Components of the same challenge? - PowerPoint PPT Presentation

1 / 40
About This Presentation
Title:

{Ontology: Resource} x {Matching : Mapping} x {Schema : Instance} :: Components of the same challenge?

Description:

Invited Talk, International Workshop on Ontology Matching collocated with the 5th International Semantic Web Conference, ISWC-2006, November 5, 2006, Athens GA http ... – PowerPoint PPT presentation

Number of Views:44
Avg rating:3.0/5.0
Slides: 41
Provided by: AmitShet
Category:

less

Transcript and Presenter's Notes

Title: {Ontology: Resource} x {Matching : Mapping} x {Schema : Instance} :: Components of the same challenge?


1
Ontology Resource x Matching Mapping x
Schema Instance Components of the same
challenge?
  • Invited Talk, International Workshop on Ontology
    Matching
  • collocated with the 5th International Semantic
    Web Conference
  • ISWC-2006, November 5, 2006, Athens GA
  • Professor Amit Sheth
  • Special Thanks Meena Nagarajan
  • Acknowledgment SemDis project, funded by NSF

2
Information System needs and Ontology Matching
goals
SemDis, ISIS
3
Information systems - From mediators to
information brokering
  • Mediators between heterogeneous information
    sources
  • InfoHarness, VisualHarness, InfoSleuth, SIMS,
    Garlic etc.

Circa 1992-1996.
4
Information systems - From mediators to
information brokers
  • Information brokers
  • InfoQuilt, OBSERVER etc.

Circa 1996-2000
5
Need for querying across multiple ontologies
OBSERVER
Circa 1994, 1996-2002
6
Ontology Matching goals
  • Goals of ontology matching (and mapping, or
    integration)
  • Shallow analysis to identify dependencies for
    integration
  • Deeper analysis to create mappings for query
    based transformations / integration
  • Integrate schemas to create a global schema
  • Integrate instance bases

Sheth, Review of a real world experience in
database schema integration (Bellcore, ca. 1993)
7
Ontology Matching changing notions
  • Given the distributed nature of modeling domains
    and metadata, the need for matching advanced to
    Information Integration
  • Now
  • Query processing not limited to multiple
    databases or ontologies, but multiple domains and
    sources of information
  • Exploiting structured, semi-structured and
    unstructured data sources, multi-model Web sources

8
The process of Ontology Matching
  • Different for purposes of merging / aligning
    ontologies
  • Type of relationships that suffice to be
    discovered are limited to equivalence / inclusion
    / disjointness / overlap mappings
  • Different for purposes of information integration
    to analytics to discovery
  • Need for discovering more Complex mappings
  • Named relationships / associations
  • Graph based / numerical mappings

9
Top down and bottom up view to ontology matching
  • Top Down schema instance integration to
    provide information integration
  • Top Down schema instance integration to
    provide information integration

10
Top down and bottom up view to ontology matching
  • Bottom up exploit external data sources to drive
    schema matching

11
A step backDB vs. Ontology - Fundamental
differences
12
Schema integration goals DB vs. Ontology
  • DB schema integration goal
  • Defining an integrated view of the data for all
    applications using the data.
  • Ontology schema integration goal
  • Defining an agreement between multiple ontology
    schemas modeled for the same domain.

13
Goals are different because of differences in
  • The modeling paradigms
  • A database schema is a model for the data that
    one more applications intend to use.
  • An ontology is a model of knowledge for a bounded
    region of interest (also known as a domain)
  • Data vs. Knowledge A DB instance base is not
    the same as an ontology instance base
  • A database models data to be used by one or more
    applications
  • An ontology models knowledge about a domain,
    independent of the application

14
Modeling Database vs. Ontology schemas -
Fundamental differences
Axis of comparison Database schemas Ontology schemas
Modeling perspective Intended to model data being used by one or more applications Intended to model a domain
Structure vs. Semantics Emphasis while modeling is on structure of the tables Emphasis while modeling is on the semantics of the domain emphasis on relationships, also facts/knowledge/ground truth
15
Agreement Limited to a syntactic agreement between applications using the data Symbolizes agreement of the modeling of a domain possibly used by applications in varying contexts.
Instance metadata modeling / expressiveness Limited expressivity in capturing instance level metadata due to static schemas More expressive modeling paradigm
Context of modeling Well defined by applications using the data Modeling of a domain irrespective of applications
Choice of modeling affects the possible space of
heterogeneities and therefore the process of
matching.
In both cases however, the schema is only an
abstraction of the real world the real
power/semantics lies at the instance level.
16
The space of heterogeneities in DB schema
integration
  • Conflicts/Heterogeneities in DB schema
    integration
  • Model / representation relational vs. network
    vs. hierarchical models
  • Structural / schematic
  • Domain Incompatibilities
  • Entity Definition Incompatibilities
  • Data Value Incompatibilities
  • Abstraction level Incompatibilities
  • Largely syntactic and structural relatively few
    semantic conflicts

Sheth/Kashyap 1992, Kim/Seo 1993, Kashyap/Sheth
1996)
17
The space of heterogeneities in ontology schema
integration
  • Conflicts/Heterogeneities in ontology schema
    integration
  • Significant conflicts in perception of a domain
    semantic conflicts
  • Other heterogeneities are similar to those in the
    DB world
  • Model / representation OWL/RDF topic maps
    etc.
  • Structural modeling as an entity vs. an
    attribute/property generalization vs.
    abstraction etc.
  • Largely semantic conflicts comparable syntactic
    conflicts

18
Key Observations
  • There are significant philosophical differences
    in how a DB schema and an Ontology schema are
    modeled
  • In spite of these distinctions, many schema
    matching techniques overlap significantly.
  • Have we advanced the state of art in ontology
    schema matching?

19
Schema Integration DB vs. OntologyHave we
advanced the state of art ?
20
Schema Integration techniques used
Schema matching techniques
Information exploited
DB
Ontology
Schema level
  • Syntactic
  • Linguistic Matching names, descriptions,
    namespaces etc.
  • Constraint-based Constraint matches on data
    types, value ranges, uniqueness, cardinalities
    etc.
  • Matching Table and column level names and
    constraints
  • Matching class, properties/ relationship,
    attribute level names and constraints

21
Schema Integration techniques used
Schema matching techniques
Information exploited
DB
Ontology
Schema level
  • Structural
  • Constraint-based Tree / Graph structure matching
  • Matching structures of relational tables
  • Matching class hierarchies and structures

22
Schema Integration techniques used
Schema matching techniques
Information exploited
DB
Ontology
Instance level
  • Linguistic
  • IR techniques, word frequencies, key terms,
    combination of key terms etc.
  • Constraint based
  • Numerical value patterns, ranges useful for
    recognizing phone numbers etc.
  • Hybrid approaches use a combination of all
    techniques

23
Discovered semantic relationships
  • State of the art in DBs and Ontologies
  • Relationships with set semantics overlap /
    disjointness / exclusion / equivalence /
    subsumption
  • Their logical encodings are what they mean
  • Of more interest is discovering arbitrary named
    relationships
  • Relationships such as works_for or causes have
    real-world semantics. Their encoding in first
    order logic lacks semantic grounding.
  • Matching and mapping closely tied. Ability to
    capture complex mapping (e.g., semantic
    proximity) puts significantly different demand on
    matching

24
Key Observation
  • DB and Ontology schema matching techniques
    overlap significantly
  • Not much advancement since DB schema integration
    efforts
  • Ontologies formalize the semantics of a domain,
    but matching is still primarily syntactic /
    structural.
  • The semantics of named relationships is largely
    unexploited
  • The real semantics lies in the relationships
    connecting entities
  • Modeled as first class objects in Ontologies
  • In DB, they are not explicit and have to be
    inferred

25
(Complex) named relationships and Ontology
Matching
26
(Complex) named relationships - example
AFFECTS
27
Discovering such (complex) named relationships
  • Matching techniques have exhausted Schema
    Instance properties
  • Ontology modeling de couples schema instance
    base
  • Tremendous opportunity to exploit knowledge
    present outside the ontology knowledge base
    (External structured, semi-structured and
    unstructured data sources)

28
Knowledge discovery and validation
Rele-vant docs
Query and update
PubMedetc.
29
A Vision for Ontology Matching Discovering
simple to complex matches from schema,
instances and corpus
Possible identifiable matches equivalence /
inclusion / overlap / disjointness
SIMPLE TO COMPLEX MATCHES
Possible to identify more complex relationships
from the corpus.
30
Corpus based schema matching
31
The Intuition
UMLS
Biologically active substance
affects
complicates
causes
causes
Disease or Syndrome
instance_of
instance_of
???????
Raynauds Disease
Fish Oils
MeSH
9284 documents
PubMed
4733 documents
5 documents
32
The Method Identify entities and Relationships
in Parse Tree
33
Key Observation
  • What is interesting is not the entity estrogen
    or endometrium
  • The real knowledge lies in the complex and
    modified entities an excessive endogeneous
    stimulation by estrogen

Current KR frameworks do not model this.
Capturing this might affect the way we think of
matching and mapping.
34
Converting candidate relationships to ontology
matches
  • Linguistic and statistical challenges
  • Variations of entities, relationships and
    associations
  • Translating instance level findings to the schema
    level
  • GOING FROM several discovered relationships like
    Deficiency in migraine causes Migraine TO
    substance X causes condition Y

35
Discovery vs. Validation of relationships two
sides of the coin
  • Discovering complex relationships from text is a
    hard problem
  • Natural Language challenges (not all sentences
    are well formed)
  • Validating complex relationships / hypothesis is
    relatively simpler

36
Corpus based Hypothesis validation
Does magnesium alleviate effects of migraine in
patients? One possible hypothesized connection
between magnesium and migraine.
PubMed
37
From matching to mappings several challenges
  • Mappings are not always simple mathematical /
    string transformations
  • Examples of complex mappings
  • Associations / paths between classes
  • Graph based / form fitting functions

Number of earthquakes with magnitude gt 7 almost
constant. So if at all, then nuclear tests only
cause earthquakes with magnitude lt 7
38
The take home message
39
A world beyond simple matches and mappings
  • The distinction between schema and instances is
    slowly disappearing
  • Integrating new and external data sources, mining
    and analyzing them is gaining importance.
  • Tremendous opportunities and challenges in using
    more information than what is modeled in a schema
    and captured in an instance base.

Need to go beyond well-mannered schemas and
knowledge representations and relatively
simpler mappings
40
For more information
  • LSDIS Lab http//lsdis.cs.uga.edu
  • Kno.e.sis Center http//www.knoesis.org
Write a Comment
User Comments (0)
About PowerShow.com