Semantic Web research anno 2006: main streams, popular falacies, current status, future challenges - PowerPoint PPT Presentation

About This Presentation
Title:

Semantic Web research anno 2006: main streams, popular falacies, current status, future challenges

Description:

'referring to the topics of the day, of temporary interest' ... music: CDnow (2410/5), MusicMoz (1073/7) community efforts. biomedical: SNOMED (200k), GO (15k) ... – PowerPoint PPT presentation

Number of Views:121
Avg rating:3.0/5.0
Slides: 34
Provided by: frankvan4
Category:

less

Transcript and Presenter's Notes

Title: Semantic Web research anno 2006: main streams, popular falacies, current status, future challenges


1
Semantic Web research anno 2006main streams,
popular falacies, current status, future
challenges
  • Frank van Harmelen
  • Vrije Universiteit Amsterdam

2
(No Transcript)
3
This is a topical talkWebster referring
to the topics of the day, of temporary
interest
4
Which Semantic Web are we talking about?
Semantic Web research anno 2006main streams,
popular falacies, current status, future
challenges
main streams
5
General idea of Semantic Web
  • Make current web more machine accessible(currentl
    y all the intelligence is in the user)
  • Motivating use-cases
  • Search engines
  • concepts, not keywords
  • semantic narrowing/widening of queries
  • Shopbots
  • semantic interchange, not screenscraping
  • E-commerce
  • Negotiation, catalogue mapping, data-integration
  • Web Services
  • Need semantic characterisations to find them
  • Navigation
  • by semantic proximity, not hardwired links
  • .....

6
General idea of Semantic Web(2)
  • Do this by
  • Making data and meta-dataavailable on the Webin
    machine-understandable form (formalised)
  • Structure the data and meta-data in ontologies

7
machine-understandable form (What its
like to be a machine)
META-DATA
8
Expressed using the W3C stack
9
Which Semantic Web?
  • Version 1"Semantic Web as Web of Data" (TBL)
  • recipeexpose databases on the web, use RDF,
    integrate
  • meta-data from
  • expressing DB schema semantics in machine
    interpretable ways
  • enable integration and unexpected re-use

10
Which Semantic Web?
  • Version 2Enrichment of the current Web
  • recipeAnnotate, classify, index
  • meta-data from
  • automatically producing markup named-entity
    recognition, concept extraction, tagging, etc.
  • enable personalisation, search, browse,..

11
Which Semantic Web?
  • Version 1Semantic Web as Web of Data
  • Version 2Enrichment of the current Web
  • Different use-cases
  • Different techniques
  • Different users

12
Four popular falacies about the Semantic Web
Semantic Web research anno 2006main streams,
popular falacies, current status, future
challenges
popular falacies
13
First clear up some popular misunderstandings
  • False statement No ?
  • Semantic Web people try to enforce meaning from
    the top

They only enforce a language.They dont
enforce what is said in that language Compare
HTML enforced from the top,But content is
entirely free.
14
First clear up some popular misunderstandings
  • False statement No ?
  • The Semantic Web people will require everybody
    to subscribe to a single predefined "meaning" for
    the terms we use.

Of course, meaning is fluid, contextual,
etc. Lots of work on (semi)-automatically
bridging between different vocabularies.
15
First clear up some popular misunderstandings
  • False statement No ?
  • The Semantic Web will require users to
    understand the complicated details of formalised
    knowledge representation.

All of this is under the hood.
16
First clear up some popular misunderstandings
  • False statement No ?
  • The Semantic Web people will require us to
    manually markup all the existing web-pages.

Lots of work on automatically producing semantic
markup named-entity recognition, concept
extraction, etc.
17
The current state of Semantic Web
Semantic Web research anno 2006main streams,
popular falacies, current status, future
challenges
current status
18
4 hard questions on the Semantic Web
  • Q1 "where does the meta-data come from?
  • NL technology is delivering on
    concept-extraction
  • Socially emerging (learning from tagging).
  • Q2 where do the meta-data-schema come from?
  • many handcrafted schema
  • hierarchy learning remains hard
  • relation extraction remains hard.
  • Q3 what to do with many meta-data schema?
  • ontology mapping/aligning remains VERY hard.
  • Q4 wheres the Web in the Semantic Web?
  • more attention to social aspects (P2P, FOAF)
  • non-textual media remains hard
  • deal with typical Web requirements.

19
Q1 Where do the ontologies come from?
  • Professional bodies, scientific communities,
    companies, publishers, .
  • Good old fashioned Knowledge Engineering
  • Convert from DB-schema, UML, etc.
  • Learning remains very hard

20
Q1 Where do the ontologies come from?
  • handcrafted
  • music CDnow (2410/5), MusicMoz (1073/7)
  • community efforts
  • biomedical SNOMED (200k), GO (15k),
  • commercial Emtree(45k190k)
  • ranging from lightweight (Yahoo) to
    heavyweight (Cyc)
  • ranging from small (METAR) to large (UNSPC)

21
Q2 Where do the annotations come from?
  • Automated learning
  • shallow natural language analysis
  • Concept extraction

Example Encyclopedia Britannica on Amsterdam
22
Q2 Where do the annotations come from?
  • lightweight NLP
  • Dutch language semantic search engine
  • exploit existing legacy-data
  • Amazon
  • Lab equipment
  • side-effect from user interaction
  • MIT Lab photo-annotator
  • NOT from manual effort

23
Q3 What to do with many ontologies?
  • Mesh
  • Medical Subject Headings, National Library of
    Medicine
  • 22.000 descriptions
  • EMTREE
  • Commercial Elsevier, Drugs and diseases
  • 45.000 terms, 190.000 synonyms
  • UMLS
  • Integrates 100 different vocabularies
  • SNOMED
  • 200.000 concepts, College of American
    Pathologists
  • Gene Ontology
  • 15.000 terms in molecular biology
  • NCI Cancer Ontology
  • 17,000 classes (about 1M definitions),

24
Q3 What to do with many ontologies?
  • Stitching all this together by hand?

25
Q3 What to do with many ontologies?
  • Linguistics structure
  • Shared vocabulary
  • Instance-based matching
  • Shared background knowledge

26
Where are we now tools
  • Languages are stable
  • Tooling is rapidly emerging
  • HP, IBM, Oracle, Adobe,
  • Parsers,
  • Editors,
  • visualisers,
  • large scale storage and querying
  • Portal generation, search

27
Where are we now applications
  • healthy uptake in some areas
  • knowledge management / intranets
  • data-integration
  • life-sciences
  • convergence with Semantic Grid
  • cultural heritage
  • still very few applications in
  • personalisation
  • mobility/context awareness
  • Most applications for companies, few
    applications for the public

28
Future directions/challenges
Semantic Web research anno 2006main streams,
popular falacies, current status, future
challenges
future challenges
29
Semantic Web as an integrator of many different
subfields
  • Databases
  • Natural Language Processing
  • Knowledge Representation
  • Machine Learning
  • Information Retrieval
  • Agents
  • HCI
  • .

30
Provocation
  • Ontology research is done
  • We know how to make, maintain deploy them
  • We have tools methods forediting, storing,
    inferencing, visualising, etc
  • except for two problems
  • Learning
  • Mapping
  • Natural lang. technology is also done
  • at least its good enough

31
Large open questions
  • Ontology learning mapping
  • emerging semantics (social statistical)
  • Semantic Web services
  • discovery, composition realistic?
  • non-textual media
  • the semantic gap text or social?
  • Deployment
  • data-integration
  • search
  • personalisation

32
Changing focus
centralised, formalised, complete, precise
distributed, heterogeneous, open, P2P,
approximate, lightweight
Web 3.0 Web 2.0 Semantic Web
33
Predicting the future
Slide by Carol Goble

Flexible extensible Metadata schemas
RDF
Write a Comment
User Comments (0)
About PowerShow.com