Title
GlOSS: text-source discovery over the Internet
Abstract
The dramatic growth of the Internet has created a new problem for users: location of the relevant sources of documents. This article presents a framework for (and experimentally analyzes a solution to) this problem, which we call the text-source discovery problem. Our approach consists of two phases. First, each text source exports its contents to a centralized service. Second, users present queries to the service, which returns an ordered list of promising text sources. This article describes GlOSS, Glossary of Servers Server, with two versions: bGlOSS, which provides a Boolean query retrieval model, and vGlOSS, which provides a vector-space retrieval model. We also present hGlOSS, which provides a decentralized version of the system. We extensively describe the methodology for measuring the retrieval effectiveness of these systems and provide experimental evidence, based on actual data, that all three systems are highly effective in determining promising text sources for a given query.
Year
DOI
Venue
1999
10.1145/320248.320252
ACM Trans. Database Syst.
Keywords
DocType
Volume
vector-space retrieval model,present hGlOSS,text-source discovery problem,centralized service,retrieval effectiveness,users present query,new problem,promising text source,Boolean query retrieval model,text source
Journal
24
Issue
ISSN
Citations 
2
0362-5915
191
PageRank 
References 
Authors
48.79
26
3
Search Limit
100191
Name
Order
Citations
PageRank
L. Gravano15668855.47
Héctor García-Molina2243595652.13
Anthony Tomasic3430242.23