Title
Distributed information management in the National HPCC Software Exchange
Abstract
Currently, most approaches to retrieving textual materials from scientific databases depend on a lexical match between words in usersý requests and those in or assigned to documents in a database. Because of the tremendous diversity in the words people use to describe the same document, lexical methods are necessarily incomplete and imprecise. Using the singular value decomposition (SVD), one can take advantage of the implicit higher-order structure in the association of terms with documents by determining the SVD of large sparse term by document matrices. Terms and documents represented by 200-300 of the largest singular vectors are then matched against user queries. We call this retrieval method Latent Semantic Indexing (LSI) because the subspace represents important associative relationships between terms and documents that are not evident in individual documents. LSI is a completely automatic yet intelligent indexing method, widely applicable, and a promising way to improve usersý access to many kinds of textual materials, or to documents and services for which textual descriptions are available. A survey of the computational requirements for managing LSI-encoded databases as well as current and future applications of LSI is presented.
Year
DOI
Venue
1995
10.1145/224170.224211
SC
Keywords
Field
DocType
HPCC,high performance computing,information management,information retrieval,software repository,HPCC,high performance computing,information management,information retrieval,software repository
Data science,Information management,Software repository,Computer science,Collaborative software,Search engine indexing,Software,Concurrent computing,Database,Access network,Home page,Distributed computing
Conference
ISSN
ISBN
Citations 
1063-9535
0-89791-816-9
0
PageRank 
References 
Authors
0.34
2
8
Name
Order
Citations
PageRank
Shirley Browne14225.93
Jack J. Dongarra2176252615.79
Geoffrey Fox34070575.38
Ken Hawick4215.58
Ken Kennedy53659663.70
Rick Stevens600.34
Robert Olson750838.89
Tom Rowan8244.65