Title | ||
---|---|---|
Using statistical sampling for query optimization in heterogeneous library information systems |
Abstract | ||
---|---|---|
Query optimization in heterogeneous database systems is not always possible since the component DBMS may not have the ability to transmit necessary information. However, these systems need query optimization because the cost of transmitting large quantities of data across diverse databases is very high. We propose a query strategy which uses hypothesis testing to determine which of two sets of data are larger. Our experiments show that this strategy is very likely to select the smaller set when the sampling results fall outside a region of uncertainty we call the “grey zone.” This provides query optimization without transmission of database statistics. |
Year | DOI | Venue |
---|---|---|
1993 | 10.1145/170791.170904 | ACM Conference on Computer Science |
Keywords | Field | DocType |
statistical sampling,component dbms,database statistic,heterogeneous library information system,query optimization,large quantity,grey zone,hypothesis testing,query strategy,necessary information,heterogeneous database system,diverse databases,information system,hypothesis test | Information system,Query optimization,Data mining,Query expansion,Information retrieval,Computer science,Sargable,View,Web query classification,Query by Example,Online aggregation | Conference |
ISBN | Citations | PageRank |
0-89791-558-5 | 0 | 0.34 |
References | Authors | |
4 | 2 |
Name | Order | Citations | PageRank |
---|---|---|---|
Gregory D. Speegle | 1 | 39 | 40.20 |
Michael J. Donahoo | 2 | 137 | 52.31 |