Information Integration and the Semantic Web Finding knowledge, data and answers - PowerPoint PPT Presentation

1 / 57
About This Presentation
Title:

Information Integration and the Semantic Web Finding knowledge, data and answers

Description:

In an new estuary, Nile Tilapia could compete with ostracods ... Datasets, queries and results can be saved, tagged, annotated, shared, searched for, etc. ... – PowerPoint PPT presentation

Number of Views:76
Avg rating:3.0/5.0
Slides: 58
Provided by: ebiqui
Category:

less

Transcript and Presenter's Notes

Title: Information Integration and the Semantic Web Finding knowledge, data and answers


1
Information Integration and the Semantic
WebFinding knowledge, data and answers
  • Tim Finin
  • University of Maryland, Baltimore County
  • http//ebiquity.umbc.edu/resource/html/id/327/
  • Joint work with Li Ding, Anupam Joshi, Yun Peng,
    Cynthia Parr, Pranam Kolari, Pavan Reddivari,
    Sandor Dornbush, Rong Pan, Akshay Java, Joel
    Sachs, Scott Cost and Vishal Doshi

? http//creativecommons.org/licenses/by-nc-sa/2.0
/ This work was partially supported by DARPA
contract F30602-97-1-0215, NSF grants CCR007080
and IIS9875433 and grants from IBM, Fujitsu and
HP.
2
This talk
  • Motivation
  • Swoogle Semantic Websearch engine
  • Use cases and applications
  • Observations
  • Conclusions

3
Google has made us smarter
4
But what about our agents?
  • Agents still have a very minimal understanding of
    text and images.

5
But what about our agents?
  • A Google for knowledge on the Semantic Web is
    needed by software agents and programs

6
This talk
  • Motivation
  • Swoogle Semantic Websearch engine
  • Use cases and applications
  • Observations
  • Conclusions

7
  • http//swoogle.umbc.edu/
  • Running since summer 2004
  • 1.8M RDF docs, 320M triples, 10K ontologies,15K
    namespaces, 1.3M classes, 175K properties, 43M
    instances, 600 registered users

8
Swoogle Architecture
9
This talk
  • Motivation
  • Swoogle Semantic Websearch engine
  • Use cases and applications
  • Observations
  • Conclusions

10
Applications and use cases
  • Supporting Semantic Web developers
  • Ontology designers, vocabulary discovery, whos
    using my ontologies or data?, use analysis,
    errors, statistics, etc.
  • Helping scientists publish and find data
  • Spire aggregating observations and data from
    biologists
  • InferenceWeb searching over and enhancing proofs
  • SemNews Text Meaning of news stories
  • Supporting SW tools
  • Triple shop finding data for SPARQL queries

1
2
3
11
1
12
80 ontologies were found that had these three
terms
By default, ontologies are ordered by their
popularity, but they can also be ordered by
recency or size.
Lets look at this one
13
(No Transcript)
14
rdfsrange was used 41 times to assert a value.
owlObjectProperty was instantiated 28 times
timeCal defined once and used 24 times (e.g.,
as range)
15
These are the namespaces this ontology uses.
Clicking on one shows all of the documents using
the namespace.
All of this is available in RDF form for the
agents among us.
16
Heres what the agent sees. Note the swoogle and
wob (web of belief) ontologies.
17
We can also search for terms (classes,
properties) like terms for person.
18
10K terms associated with person! Ordered by
use.
Lets look at foafPersons metadata
19
(No Transcript)
20
(No Transcript)
21
(No Transcript)
22
87K documents used foafgender with a foafPerson
instance as the subject
23
3K documents used dccreator with a foafPerson
instance as the object
24
2
  • An NSF ITR collaborative project with
  • University of Maryland, Baltimore County
  • University of Maryland, College Park
  • U. Of California, Davis
  • Rocky Mountain Biological Laboratory

25
An invasive species scenario
  • Nile Tilapia fish have been found in a California
    lake.
  • Can this invasive species thrive in this
    environment?
  • If so, what will be the likelyconsequences for
    theecology?
  • Sowe need to understandthe effects of
    introducingthis fish into the food webof a
    typical California lake

26
Food Webs
  • A food web models the trophic (feeding)
    relationships between organisms in an ecology
  • Food web simulators are used to explore the
    consequences of changes in the ecology, such as
    the introduction or removal of a species
  • A locations food web is usually constructed from
    studies of the frequencies of the species found
    there and the known trophic relations among them.
  • Goal automatically construct a food web for a
    new location using existing data and knowledge
  • ELVIS Ecosystem Location Visualization and
    Information System

27
East River Valley Trophic Web
http//www.foodwebs.org/
28
Species List Constructor
  • Click a county, get a species list

29
The problem
  • We have data on what species are known to be in
    the location and can further restrict and fill in
    with other ecological models
  • But we dont know which of these the Nile Tilapia
    eats of who might eat it.
  • We can reason from taxonomic data (simlar
    species) and known natural history data (size,
    mass, habitat, etc.) to fill in the gaps.

30
(No Transcript)
31
Food Web Constructor
  • Predict food web links using database and
    taxonomic reasoning.

In an new estuary, Nile Tilapia could compete
with ostracods (green) to eat algae. Predators
(red) and prey (blue) of ostracods may be affected
32
Evidence Provider
  • Examine evidence for predicted links.

33
Status
  • Goal is ELVIS (Ecosystem Location Visualization
    and Information System) as an integrated set of
    web services for constructing food webs for a
    given location.
  • Background ontologies
  • SpireEcoConcepts concepts and properties to
    represent food webs, and ELVIS related tasks,
    inputs and outputs
  • ETHAN (Evolutionary Trees and Natural History)
    Concepts and properties for natural history
    information on species derived from data in the
    Animal diversity web and other taxonomic sources
  • Under development
  • Connect to visualization software
  • Connect to triple shop to discover more data

34
UMBC Triple Shop
3
  • http//sparql.cs.umbc.edu/
  • Online SPARQL RDF query processing with several
    interesting features
  • Automatically finds SWDs for give queries using
    Swoogle backend database
  • Datasets, queries and results can be saved,
    tagged, annotated, shared, searched for, etc.
  • RDF datasets as first class objects
  • Can be stored on our server or downloaded
  • Can be materialized in a database or(soon) as a
    Jena model

35
Who knows Anupam Joshi? Show me their names,
email address and pictures
36
The UMBC ebiquity site publishes lots of RDF
data, including FOAF profiles
37
PREFIX foaf lthttp//xmlns.com/foaf/0.1/gt SELECT
DISTINCT ?p2name ?p2mbox ?p2pix FROM ??? WHERE
?p1 foafsurname "Joshi" . ?p1
foaffirstName Anupam" . ?p1
foafmbox ?p1mbox . ?p2
foafknows ?p3 . ?p3 foafmbox
?p1mbox . ?p2 foafname ?p2name
. ?p2 foafmbox ?p2mbox .
OPTIONAL ?p2 foafdepiction ?p2pix .
ORDER BY ?p2name
38
(No Transcript)
39
(No Transcript)
40
(No Transcript)
41
302 RDF documents were found that might have
useful data.
42
Well select them all and add them to the current
dataset.
43
Well run the query against this dataset to see
if the results are as expected.
44
The results can be produced in any of several
formats
45
(No Transcript)
46
Looks like a useful dataset. Lets save it and
also materialize it the TS triple store.
47
(No Transcript)
48
We can also annotate, save and share queries.
49
Work in Progress
  • There are a host of performance issues
  • We plan on supporting some special datasets,
    e.g.,
  • FOAF data collected from Swoogle
  • Definitions of RDF and OWL classes and properties
    from all ontologies that Swoogle has discovered
  • Expanding constraints to select candidate SWDs to
    include arbitrary metadata and embedded queries
  • FROM documents trusted by a member of the SPIRE
    project
  • We will explore two models for making this useful
  • As a downloadable application for client machines
  • As an (open source?) downloadable service for
    servers supporting a community of users.

50
This talk
  • Motivation
  • Swoogle Semantic Websearch engine
  • Use cases and applications
  • Observations
  • Conclusions

51
Will Swoogle Scale? How?
  • Heres a rough estimate of the data in RDF
    documents on the semantic web based on Swoogles
    crawling

System/date Terms Documents Individuals Triples Bytes
Swoogle2 1.5x105 3.5x105 7x106 5x107 7x109
Swoogle3 2x105 7x105 1.5x107 7.5x107 1x1010
2006 1x106 5x107 5x107 5x109 5x1011
2008 5x106 5x109 5x109 5x1011 5x1013
We think Swoogles centralized approach can be
made to work for the next few years if not longer.
52
How much reasoning should Swoogle do?
  • SwoogleN (Nlt3) does limited reasoning
  • Its expensive
  • Its not clear how much should be done
  • More reasoning would benefit many use cases
  • e.g., type hierarchy
  • Recognizing specialized metadata
  • E.g., that ontology A some maps terms from B to C

53
A RDF Dictionary
  • We hope to develop an RDF dictionary.
  • Given an RDF term, returns a graph of its
    definiton
  • Term ? definition from official ontology
  • TermURL ? definition from SWD at URL
  • Term ? union definition
  • Optional argument recursively adds definitions of
    terms in definition excluding RDFS and OWL terms
  • Optional arguments identifies more namespaces to
    exclude

54
This talk
  • Motivation
  • Swoogle Semantic Websearch engine
  • Use cases and applications
  • Observations
  • Conclusions

55
Conclusion
  • The web will contain the worlds knowledge in
    forms accessible to people and computers
  • We need better ways to discover, index, search
    and reason over SW knowledge
  • SW search engines address different tasks than
    html search engines
  • So they require different techniques and APIs
  • Swoogle like systems can help create consensus
    ontologies and foster best practices
  • Swoogle is for Semantic Web 1.0
  • Semantic Web 2.0 will make different demands

56
For more information
http//ebiquity.umbc.edu/
Annotatedin OWL
57
Backup
Write a Comment
User Comments (0)
About PowerShow.com