Title
Enhancing Search and Browse Using Automated Clustering of Subject Metadata
Abstract
The Web puzzle of online information resources often hinders end-users from effective and efficient access to these resources. Clustering resources into appropriate subject-based groupings may help alleviate these difficulties, but will it work with heterogeneous material? The University of Michigan and the University of California Irvine joined forces to test automatically enhancing metadata records using the Topic Modeling algorithm on the varied OAIster corpus. We created labels for the resulting clusters of metadata records, matched the clusters to an in-house classification system, and developed a prototype that would showcase methods for search and retrieval using the enhanced records. Results indicated that while the algorithm was somewhat time-intensive to run and using a local classification scheme had its drawbacks, precise clustering of records was achieved and the prototype interface proved that faceted classification could be powerful in helping end-users find resources.
Year
DOI
Venue
2007
10.1045/july2007-hagedorn
D-Lib Magazine
Keywords
Field
DocType
classification system,clustering,metadata,digital libraries,algorithms
Open Archives Initiative,Metadata,World Wide Web,Information retrieval,Computer science,Classification scheme,Topic model,Digital library,Cluster analysis,Faceted classification
Journal
Volume
Issue
Citations 
13
7/8
8
PageRank 
References 
Authors
0.69
9
3
Name
Order
Citations
PageRank
Kat Hagedorn1424.90
Suzanne Chapman280.69
David Newman3131973.72