Abstract | ||
---|---|---|
Most previous information retrieval (IR) models assume that terms of queries and documents are statistically independent from each another. However, this kind of conditional independence assumption is obviously and openly understood to be wrong, so we present a new method of incorporating term dependence in probabilistic retrieval model by adapting Bahadur-Lazarsfeld expansion (BLE) to compensate the weakness of the assumption. In this paper, we describe a theoretic process to apply BLE to the general probabilistic models and the state-of-the-art 2-Poisson model. Through the experiments on two standard document collections, HANTEC2.0 in Korean and WT10g in English, we demonstrate that incorporation of term dependences using the BLE significantly contribute to the improvement of performance in at least two different language IR systems. |
Year | DOI | Venue |
---|---|---|
2003 | 10.1016/S0306-4573(02)00078-X | Inf. Process. Manage. |
Keywords | Field | DocType |
probabilistic retrieval model,information retrieval,different language ir system,previous information retrieval,bahadur–lazarsfeld expansion,term dependence,general probabilistic model,bahadur-lazarsfeld expansion,probabilistic information retrieval model,2-poisson model,probabilistic model,standard document collection,exploring term dependence,conditional independence assumption,new method,poisson model,statistical independence,probability,conditional independence,relevance information retrieval,comparative analysis,performance | Standard (document),Divergence-from-randomness model,Information retrieval,Conditional independence,Computer science,Algorithm,Theoretical computer science,Statistical model,Relevance (information retrieval),Probabilistic logic,Probabilistic information retrieval,Independence (probability theory) | Journal |
Volume | Issue | ISSN |
39 | 4 | Information Processing and Management |
Citations | PageRank | References |
5 | 0.48 | 18 |
Authors | ||
3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Bong-hyun Cho | 1 | 47 | 6.37 |
Changki Lee | 2 | 279 | 26.18 |
Gary Geunbae Lee | 3 | 932 | 93.23 |