How (Not) to Use a Semi-automated Clustering Tool

About This Presentation

Title:

Description:

Number of Views:62

Avg rating:3.0/5.0

Slides: 9

Provided by: KatHag8

Learn more at: https://apps.lib.umich.edu

Category:

more less

Transcript and Presenter's Notes

Title: How (Not) to Use a Semi-automated Clustering Tool

1
How (Not) to Use a Semi-automated Clustering Tool

2
Update on UMs efforts

3
The need to cluster

Want to offer more than search within a generic,
large corpus of data
How to partition the data?
Emorys MetaCombine tool promising as a topical
clustering agent
(Also interested in clustering by format, access
restriction, OAI software used, etc.)

4
Clustering vs. classification

5
Results duration

6
Results cluster names

7
Caveats

8
What we need

Running the tool locally, with a local WSDL
instance, would save lots (and lots) of time
Better set namesdoes this mean a better
algorithm?
Ability to cluster by any criteria, not just
topic, i.e., a post-processing module
Disjunctive clustering, meaning (so as not to hog
storage) filename (not file) clustering

Write a Comment

User Comments (0)

About PowerShow.com

How (Not) to Use a Semi-automated Clustering Tool - PowerPoint PPT Presentation