NKOS Workshop - PowerPoint PPT Presentation

1 / 22
About This Presentation
Title:

NKOS Workshop

Description:

Semantic Portal Business and Economics Project Report. NKOS Workshop ... Evaluation of Ex Libris 'Primo' product. 10 /11. Kai Eckert and Magnus Pfeffer ... – PowerPoint PPT presentation

Number of Views:40
Avg rating:3.0/5.0
Slides: 23
Provided by: compG
Category:
Tags: nkos | primo | workshop

less

Transcript and Presenter's Notes

Title: NKOS Workshop


1
Project ReportSemantic Portal Business and
Economics
  • NKOS Workshop
  • September 19th 2008
  • Aarhus, Denmark

2
Project Goal
  • Creating a OPAC Library Search Enginge
  • Content
  • Library media
  • All licenced fulltext documents
  • Focus on economics
  • Modern user interface
  • Thesaurus-based search and retrieval
  • Drill-down using facets
  • Support multiple thesauri

3
Research Topics
  • Automatic indexing in the field of economics
  • Thesaurus-based user search interfaces
  • Multi-thesaurus indexing and search

4
Current Status
  • Prototype indexing system
  • Elsevier journal articles
  • STW Thesaurus
  • Collexis Search Engine
  • Datasets
  • Automatic indexing results
  • Manually indexed articles as gold standard

5
Automatic Indexing Assessment
  • Precision and recall comparison
  • Meaningless numbers on the macro level
  • Tedious on the micro level
  • Visual analysis using Semtinel
  • Per concept IC-Diff analysis
  • Treemap for navigation
  • Easy identification of critical concepts

6
IC Diff Analysis with Semtinel
7
Automatic Indexing Assessment cont.
  • Editing of example critical thesaurus concepts
  • Lack of sysnonyms
  • Insufficient disamgibuation
  • Overly broad concepts
  • Reindexing
  • Improved Precision and recall

8
Further Steps
  • Analysis and Semtinel Tool
  • Improve framework (SKOS loader)?
  • Document based analysis methods
  • Multi-Thesaurus Retrieval
  • Multiple indexes
  • Merging multiple thesauri
  • UI Design

9
Further Steps cont.
  • Prototype retrieval system
  • Collexis engine and user interface
  • User study
  • Integration into library systems
  • Representation using RDF and DC
  • Evaluation of Ex Libris Primo product

10
Open Questions
  • How can one judge indexing results? Is our
    approach reasonable?
  • More ideas or use-cases for Semtinel?
    Feature-Requests? (e.g. Ontology-Editor, ...)?

11
Thank you for your attention.
kai_at_informatik.uni-mannheim.de magnus.pfeffer_at_bib
.uni-mannheim.de
12
Additional Slides
13
IC Diff Analysis
  • Information Content
  • Proposed by Resnik
  • Depends on Frequency in Document Base
  • Intrinsic Information Content
  • Proposed by Seco, Veale und Hayes
  • Based on the Number of Subconcepts

Intuitive A value between -1 and 1 that says, if
a concept has a suspicious frequency regarding
its position in the thesaurus.
14
Semtinel Workbench
15
Semtinel API
16
Intrinsic Information Content
17
Information Content
18
IC Diff
19
Bioscience
20
Organisms
21
Animals
22
Persons
Write a Comment
User Comments (0)
About PowerShow.com