Title: A Study of Citations in Users Online Personal Collections
1A Study of Citations in Users Online Personal
Collections
- Nishikant Kapoor
- John T Butler Sean M McNee Gary C Fouty
- James A Stemper Joseph A Konstan
- GroupLens Research Group and University
Libraries
- University of Minnesota USA
2Recommenders Systems
3Recommenders Systems
4Recommenders Systems
5Recommenders Systems
6Recommenders Systems
7Recommenders Systems
8Recommenders Systems
9No Transcript
10No Transcript
11No Transcript
12No Transcript
13Research Objectives
- Design Develop
- Personalized digital library services
- Understand
- Users research interests
14Research Questions
Can we utilize users personal citation
collections to offer them personalized DL
services?
- Can citations in users personal collections be
resolved to unique identifiers?
- How many of those do actually resolve to a
unique online identifier?
- How many of the resolved citations do actually
lead to an online source for their content or
metadata?
15RefWorks
16Citation Collections
- RefWorks users
- 96 collections 30336 citations
- Two outliers 4000 and 7000
316
17Citation Types
18Citation Types
J
B
N
R
D
S
19Citation Types
W
J
B
R
N
D
20Resolvability
- A citation is resolvable if it has
- A valid unique ID DOI for articles ISBN for
books
- Enough information to resolve it to a unique ID
All citations that can be represented using a
valid unique ID are potentially resolvable.
Unique ID of identical citations in different
collections is key to building similarities
between users.
21External Resolvers
- DOI and OpenURL Query Interfaces
- Citation resolvers at crossref.org CR
- ISBN Query Interfaces
- Citation resolver at worldcat.org WC
-
22Validity
- A URL is valid if it leads to a citations
source online
- URLs URL may or may not be unique
- Validated existence of URL not its accuracy
- Did not attempt to retrieve ID for citation
23DOI Resolvability
0 0 0 0
24ISBN Resolvability
25URL Validity
26Resolvability Overlap
27Resolvability Summary
8540 47
28Limitations Concerns
- Very limited resolvers were used
- Additional resolvers such as the Citation
Matcher from PubMed could enhance resolvability
further
- Dataset too small and too diverse
- Difficult to find correlation among users
- CF based services work better with larger
dataset
- Privacy concerns
- Users want a control b anonymity
29Future Work
- Survey - Users willingness to share their
personal collections
- Understand how truly do users personal
collections represent their profile?
- Prototype of CF based DL services
- http//techlens.cs.umn.edu/
- RecSys Minneapolis Oct 19-20 2007
30Resources
- GroupLens Research Group Dept of CSEE
- University of Minnesota USA
- http//www.grouplens.org/
- MovieLens Recommender System
- http//www.movielens.org/
- TechLens Recommender System
- http//techlens.cs.umn.edu/
31Acknowledgements
- NSF grant IIS-0534939
- RefWorks http//www.refworks.com/
-
32A Study of Citations in Users Online Personal
Collections
- Nishikant Kapoor
- Questions?