Title: Recuperao de Informao B
1Q0. What is the informative structure of a new
Web site as defined in the paper? Q1. What
problem(s) is (are) the authors proposed to
solve? How is (are) the problem(s) solved?
What are the main design goals of the paper?
Q2. The HITS (Hyperlink Induced Topics Search)
algorithm defines two values, hub and authority
values, of all Web pages. What are the two
values? Which Web pages should have high hub
value? Which Web pages should have high
authority value? Q3. The authors propose two
mechanisms in solving the HITS problem. What are
the two proposed mechanisms and what are the
design objectives of the two proposed
mechanisms? Q4. How are the content/meaning of
Web pages represented? How are the content
of linked Web pages represented?
2- Q5. Which terms are categorized as redundant,
and which terms are categorized as informative? - How are redundant and informative terms
determined? - Q6. What is the content of a feature-document
matrix? How are the values in the matrix
calculated? - Q7. How is the entropy of a feature (i.e., term)
defined? What does a feature entropy measure? - How is the entropy of a content block
defined? How is a normalized entropy of a
content block measured? - Q8. What are the entropy values of links? How
is an entropy value of a link computed? How is
it related to the IDF value defined in IR? - Q9. What is intersite redundancy? What is
intrasite redundancy? -
-