Title
Using statistical sampling for query optimization in heterogeneous library information systems
Abstract
Query optimization in heterogeneous database systems is not always possible since the component DBMS may not have the ability to transmit necessary information. However, these systems need query optimization because the cost of transmitting large quantities of data across diverse databases is very high. We propose a query strategy which uses hypothesis testing to determine which of two sets of data are larger. Our experiments show that this strategy is very likely to select the smaller set when the sampling results fall outside a region of uncertainty we call the “grey zone.” This provides query optimization without transmission of database statistics.
Year
DOI
Venue
1993
10.1145/170791.170904
ACM Conference on Computer Science
Keywords
Field
DocType
statistical sampling,component dbms,database statistic,heterogeneous library information system,query optimization,large quantity,grey zone,hypothesis testing,query strategy,necessary information,heterogeneous database system,diverse databases,information system,hypothesis test
Information system,Query optimization,Data mining,Query expansion,Information retrieval,Computer science,Sargable,View,Web query classification,Query by Example,Online aggregation
Conference
ISBN
Citations 
PageRank 
0-89791-558-5
0
0.34
References 
Authors
4
2
Name
Order
Citations
PageRank
Gregory D. Speegle13940.20
Michael J. Donahoo213752.31