Recuperao de Informao B - PowerPoint PPT Presentation

1 / 2
About This Presentation
Title:

Recuperao de Informao B

Description:

How are the content/meaning of Web pages represented? ... What is the content of a feature-document matrix? ... How is the entropy of a content block defined? ... – PowerPoint PPT presentation

Number of Views:119
Avg rating:3.0/5.0
Slides: 3
Provided by: bert194
Category:

less

Transcript and Presenter's Notes

Title: Recuperao de Informao B


1
Q0. What is the informative structure of a new
Web site as defined in the paper? Q1. What
problem(s) is (are) the authors proposed to
solve? How is (are) the problem(s) solved?
What are the main design goals of the paper?
Q2. The HITS (Hyperlink Induced Topics Search)
algorithm defines two values, hub and authority
values, of all Web pages. What are the two
values? Which Web pages should have high hub
value? Which Web pages should have high
authority value? Q3. The authors propose two
mechanisms in solving the HITS problem. What are
the two proposed mechanisms and what are the
design objectives of the two proposed
mechanisms? Q4. How are the content/meaning of
Web pages represented? How are the content
of linked Web pages represented?
2
  • Q5. Which terms are categorized as redundant,
    and which terms are categorized as informative?
  • How are redundant and informative terms
    determined?
  • Q6. What is the content of a feature-document
    matrix? How are the values in the matrix
    calculated?
  • Q7. How is the entropy of a feature (i.e., term)
    defined? What does a feature entropy measure?
  • How is the entropy of a content block
    defined? How is a normalized entropy of a
    content block measured?
  • Q8. What are the entropy values of links? How
    is an entropy value of a link computed? How is
    it related to the IDF value defined in IR?
  • Q9. What is intersite redundancy? What is
    intrasite redundancy?
Write a Comment
User Comments (0)
About PowerShow.com