Using%20WordNet%20to%20Improve%20User%20Modelling%20in%20a%20Web%20Document%20Recommender%20System - PowerPoint PPT Presentation

About This Presentation
Title:

Using%20WordNet%20to%20Improve%20User%20Modelling%20in%20a%20Web%20Document%20Recommender%20System

Description:

Learns user's interests from the requested pages. Build a model of the user. Exploit the model to anticipate which documents in the web site could be ... – PowerPoint PPT presentation

Number of Views:83
Avg rating:3.0/5.0
Slides: 10
Provided by: dingoSbs
Category:

less

Transcript and Presenter's Notes

Title: Using%20WordNet%20to%20Improve%20User%20Modelling%20in%20a%20Web%20Document%20Recommender%20System


1
Using WordNet to Improve User Modelling in a Web
Document Recommender System
  • CS 620 Class Presentation

Bernardo Magnini and Carlo Strapparava
Presented by Haifeng He
2
Problem
  • A recommender system for a Web site of
    multilingual news
  • Learns users interests from the requested pages
  • Build a model of the user
  • Exploit the model to anticipate which documents
    in the web site could be interesting for the user

3
Previous Work
  • SiteIF, a personal agent for a multilingual news
    web site
  • Word-based (word frequency and co-occurrence)
  • Not accurate enough
  • Misinterpret word sense

4
Main Idea
  • Content-based document representation
  • Build the user model as a semantic network whose
    nodes represent sense (not just words)
  • Retrieve new documents with high semantic
    relevance with respect to the use model
  • More accurate and,
  • independent from the language of the documents
    browsed(?!).
  • The problems
  • Require a repository for word senses (WordNet)
  • Word sense disambiguation (WSD)

5
Word Domain Disambiguation
  • Sense clustering with domain labels (Magnini and
    Strapparava, 2000)
  • Each word has a domain label (MEDICINE, SPORT,
    etc)
  • Reduce the WordNet polysemy
  • Covers only noun synsets now

6
Example
7
Domain Disambiguation
  • Two steps
  • Given a word, for each domain label of the word,
    give a score, which is determined by the
    frequency of the label among the senses
  • The domain label with the highest score is
    selected
  • .83 accuracy (Magnini and Strapparava, 2000)

8
(No Transcript)
9
Evaluation and Conclusions
  • Compare the output of two systems against the
    judgments of a human advisor
  • Word-based and synset based
  • H the set of human proposals, S the set of the
    system proposals
  • Precision Recall
  • Precision increase 34. Recall increase 15.
Write a Comment
User Comments (0)
About PowerShow.com