Title
Automatic transliteration for Japanese-to-English text retrieval
Abstract
For cross language information retrieval (CLIR) based on bilingual translation dictionaries, good performance depends upon lexical coverage in the dictionary. This is especially true for languages possessing few inter-language cognates, such as between Japanese and English. In this paper, we describe a method for automatically creating and validating candidate Japanese transliterated terms of English words. A phonetic English dictionary and a set of probabilistic mapping rules are used for automatically generating transliteration candidates. A monolingual Japanese corpus is then used for automatically validating the transliterated terms. We evaluate the usage of the extracted English-Japanese transliteration pairs with Japanese to English retrieval experiments over the CLEF bilingual test collections. The use of our automatically derived extension to a bilingual translation dictionary improves average precision, both before and after pseudo-relevance feedback, with gains ranging from 2.5% to 64.8%.
Year
DOI
Venue
2003
10.1145/860435.860499
SIGIR
Keywords
DocType
ISBN
clef bilingual test collection,bilingual translation dictionary,english-japanese transliteration pair,candidate japanese transliterated term,transliterated term,english retrieval experiment,automatic transliteration,cross language information retrieval,monolingual japanese corpus,japanese-to-english text retrieval,phonetic english dictionary,english word
Conference
1-58113-646-3
Citations 
PageRank 
References 
27
1.81
10
Authors
3
Name
Order
Citations
PageRank
Yan Qu1271.81
Gregory Grefenstette21129147.00
David A. Evans3841147.89