Automatic transliteration for Japanese-to-English text retrieval - Citegraph

Paper Info

Title
Automatic transliteration for Japanese-to-English text retrieval

Abstract
For cross language information retrieval (CLIR) based on bilingual translation dictionaries, good performance depends upon lexical coverage in the dictionary. This is especially true for languages possessing few inter-language cognates, such as between Japanese and English. In this paper, we describe a method for automatically creating and validating candidate Japanese transliterated terms of English words. A phonetic English dictionary and a set of probabilistic mapping rules are used for automatically generating transliteration candidates. A monolingual Japanese corpus is then used for automatically validating the transliterated terms. We evaluate the usage of the extracted English-Japanese transliteration pairs with Japanese to English retrieval experiments over the CLEF bilingual test collections. The use of our automatically derived extension to a bilingual translation dictionary improves average precision, both before and after pseudo-relevance feedback, with gains ranging from 2.5% to 64.8%.

Year	DOI	Venue
2003	10.1145/860435.860499	SIGIR
Keywords	DocType	ISBN
clef bilingual test collection,bilingual translation dictionary,english-japanese transliteration pair,candidate japanese transliterated term,transliterated term,english retrieval experiment,automatic transliteration,cross language information retrieval,monolingual japanese corpus,japanese-to-english text retrieval,phonetic english dictionary,english word	Conference	1-58113-646-3
Citations	PageRank	References
27	1.81	10
Authors
3

Authors (3 rows)

Cited by (27 rows)

References (10 rows)

Name	Order	Citations	PageRank
Yan Qu	1	27	1.81
Gregory Grefenstette	2	1129	147.00
David A. Evans	3	841	147.89

1