Title | ||
---|---|---|
The use of monolingual context vectors for missing translations in cross-language information retrieval |
Abstract | ||
---|---|---|
For cross-language text retrieval systems that rely on bilingual dictionaries for bridging the language gap between the source query language and the target document language, good bilingual dictionary coverage is imperative. For terms with missing translations, most systems employ some approaches for expanding the existing translation dictionaries. In this paper, instead of lexicon expansion, we explore whether using the context of the unknown terms can help mitigate the loss of meaning due to missing translation. Our approaches consist of two steps: (1) to identify terms that are closely associated with the unknown source language terms as context vectors and (2) to use the translations of the associated terms in the context vectors as the surrogate translations of the unknown terms. We describe a query-independent version and a query-dependent version using such monolingual context vectors. These methods are evaluated in Japanese-to-English retrieval using the NTCIR-3 topics and data sets. Empirical results show that both methods improved CLIR performance for short and medium-length queries and that the query-dependent context vectors performed better than the query-independent versions. |
Year | DOI | Venue |
---|---|---|
2005 | 10.1007/11562214_3 | IJCNLP |
Keywords | Field | DocType |
language gap,query-independent version,cross-language information retrieval,unknown term,monolingual context vector,context vector,unknown source language term,source query language,target document language,missing translation,query-dependent context vector,query language | Query language,Language translation,Bilingual dictionary,Computer science,Multilingualism,Natural language,Lexicon,Lexico,Natural language processing,Artificial intelligence,Cross-language information retrieval | Conference |
Volume | ISSN | ISBN |
3651 | 0302-9743 | 3-540-29172-5 |
Citations | PageRank | References |
0 | 0.34 | 18 |
Authors | ||
3 |
Name | Order | Citations | PageRank |
---|---|---|---|
Yan Qu | 1 | 39 | 8.83 |
Gregory Grefenstette | 2 | 1129 | 147.00 |
David A. Evans | 3 | 841 | 147.89 |