Mining the Web for Domain-Specific Translations - Citegraph

Paper Info

Title
Mining the Web for Domain-Specific Translations

Abstract
We introduce a method for learning to find domain-specific translations for a given term on the Web. In our approach, the source term is transformed into an expanded query aimed at maximizing the probability of retrieving translations from a very large collection of mixed-code documents. The method involves automatically generating sets of target- language words from training data in specific domains, automatically selecting target words for effectiveness in retrieving documents containing the sought-after translations. At run time, the given term is transformed into an expanded query and submitted to a search engine, and ranked translations are extracted from the document snippets returned by the search engine. We present a prototype, TermMine, which applies the method to a Web search engine. Evaluations over a set of domains and terms show that TermMine outperforms state-of-the-art machine translation systems.

Year	Venue	Keywords
2008	AMTA	machine translation,source term,search engine,web search engine
DocType	Citations	PageRank
Conference	0	0.34
References	Authors
14	4

Authors (4 rows)

Cited by (0 rows)

References (14 rows)

Name	Order	Citations	PageRank
Jian-Cheng Wu	1	70	13.30
Peter Wei-Huai Hsu	2	0	0.34
Chiung-Hui Tseng	3	1	0.69
Jason S. Chang	4	345	62.64

1