The Perfect Search Engine Is Not Enough - PowerPoint PPT Presentation

1 / 31
About This Presentation
Title:

The Perfect Search Engine Is Not Enough

Description:

The Perfect Search Engine Is Not Enough Jaime Teevan , Christine Alvarado , Mark S. Ackerman and David R. Karger MIT, CSAIL University of Michigan – PowerPoint PPT presentation

Number of Views:93
Avg rating:3.0/5.0
Slides: 32
Provided by: Christ1122
Category:

less

Transcript and Presenter's Notes

Title: The Perfect Search Engine Is Not Enough


1
The Perfect Search Engine Is Not Enough
  • Jaime Teevan, Christine Alvarado, Mark S.
    Ackerman and David R. Karger

MIT, CSAIL University of Michigan
2
Let Me Interview You!
  • Email
  • Whats the last email you read? What did you do
    with it?
  • Have you gone back to an email youve read
    before?
  • Web
  • Whats the last Web page you visited? How did
    you get there?
  • Have you looked for anything on the Web?
  • Files
  • Whats the last file you looked at? How did you
    get to it?
  • Have you looked for a file?

3
OverviewUnderstanding
Search
Directed
  • Introduction
  • Related work
  • Methodology
  • What we learned
  • How?
  • Why?
  • Who?
  • So what?
  • Introduction
  • Related work
  • Methodology
  • What we learned
  • How?
  • Why?
  • Who?
  • So what?

4
HaystackPersonal Information Storage
Web pages
Email
Files
Calendar
Contacts
5
Directed Search in Haystack
What was that paper I read last week about
Information Retrieval?
Haystack
6
Directed Search in Haystack
Ah yes! Thank you.
Haystack
Perfect Search Engine
7
Related Work
  • Directed search
  • Lab studies Capra03, Maglio97
  • Log analysis Broder02, Spink01
  • Observational studies Malone83
  • Information Seeking
  • Marchionini, ODay and Jeffries, Bates, Belkin,
  • Evolving information need

8
Modified Diary Study
  • Subjects 15 CS graduate students
  • Ten interviews each (2/day x 5 days)
  • Two question types
  • Last email/file/Web page looked at
  • Last email/file/Web page looked for
  • Supplemented with direct observation and an
    hour-long semi-structured interview

9
OverviewUnderstanding
Directed
Search
  • Introduction
  • Related work
  • Methodology
  • What we learned
  • How?
  • Why?
  • Who?
  • So what?

10
Directed Search Today
  • Target Connie Monroes office number

? Type into a search engine Connie Monroe,
office number
11
What We Observed
Interviewer Have you looked for anything on the
Web today? Jim I had to look for the office
number of the Harvard professor.
I So how did you go about doing that? J I
went to the homepage of the Math department at
Harvard
12
What We Observed
I So you went to the Math department, and then
what did you do over there? J It had a place
where you can find people and I went to that page
and they had a dropdown list of visiting faculty,
and so I went to that link and I looked for her
name and there it was.
13
What We Observed
  • J I knew that she had a very small Web page
    saying, Im here at Harvard. Heres my contact
    information.

14
Strategies Looking for Information
Teleporting
Orienteering
15
Why Do People Orienteer?
  • Easier than saying what you want
  • You know where you are
  • You know what you find
  • The tools dont work

16
Easier Than Saying What You Want
  • Describing the target is hard
  • Cant
  • Prefer not to
  • Habit
  • Whichever way I remember first.
  • Search for source
  • E.g., Your last email search

17
You Know Where You Are
  • Stay in known space
  • URL manipulation
  • Bookmarks
  • History
  • Backtracking
  • Following an information scent
  • Never end up at a dead end

18
You Know What You Find
  • Context gives understanding of answer
  • I was looking for a specific file. But even
    when I saw its name, I wouldnt have known that
    that was the file I wanted until I saw all of the
    other names in the same directory
  • Understanding negative results
  • I basically clicked on every single button
    until I was convinced I dont think that it
    exists

19
Individual Search Behavior
  • Search behavior varied by individual
  • Categorize based on email usage
  • Filers
  • Pilers
  • People who pile information take small steps
  • People who file information take big steps

20
How Individuals Search For Files
Filers
Big steps
Pilers
Small steps
21
More to Learn from the Data
  • Differences in finding v. re-finding
  • How organization relates to search
  • Importance of type (email, files and Web)
  • Looked at v. looked for
  • ? Keep in mind population

22
Applying What We Learned
  • ? Support orienteering
  • Advantages to orienteering
  • Easier than saying what you want
  • You know where you are
  • You know what you find
  • Individual differences in step size
  • Highlight source (e.g., flag sources with info)
  • Integrate tools used for steps
  • Support exhaustive search
  • Allow for different step sizes

23
More to Learn from the Data
  • Differences in finding v. re-finding
  • How organization relates to search
  • Importance of type (email, files and Web)
  • Looked at v. looked for
  • ? Keep in mind population

24
Structural Consistency Important
All must be the same to re-find the information!
25
Preserve What User Remembers
  • Supports orienteering for re-finding
  • Allows access to new information

26
File or Pile Email
Filer
Piler
27
Searching Other Collections
Ah yes! Thank you.
28
Keep Population in Mind
  • CS grad students not representative
  • Very familiar with search tools
  • ? Would expect to see lots of tool use

29
Relating How and What
Specific General Document
Other 47 19 41
Keyword 34 23 17
  • People only keyword search 39 of the time
  • What people look for related to how they look

Orienteer to specific information
  • Surprise

30
Relating How and Corpus
Email Files Web
Other 59 42 19
Keyword 06 10 64
  • Email and files Almost never keyword searched
  • Easy to associate information with document
  • Web Used keyword search much more often

31
Relating What and Corpus
Email Files Web
Specific 39 7 33
General 10 7 30
Document 08 35 14
  • Email searches were primarily for specific
    information
  • File searches were primarily for documents
  • Web searches were more evenly distributed
Write a Comment
User Comments (0)
About PowerShow.com