The FAUST Corpus of Adequacy Assessments for Real-World Machine Translation Output. - Citegraph

Paper Info

Title
The FAUST Corpus of Adequacy Assessments for Real-World Machine Translation Output.

Abstract
We present a corpus consisting of 11,292 real-world English to Spanish automatic translations annotated with relative (ranking) and absolute (adequate/non-adequate) quality assessments. The translation requests, collected through the popular translation portal http://reverso.net, provide a most variated sample of real-world machine translation (MT) usage, from complete sentences to units of one or two words, from well-formed to hardly intelligible texts, from technical documents to colloquial and slang snippets. In this paper, we present 1) a preliminary annotation experiment that we carried out to select the most appropriate quality criterion to be used for these data, 2) a graph-based methodology inspired by Interactive Genetic Algorithms to reduce the annotation effort, and 3) the outcomes of the full-scale annotation experiment, which result in a valuable and original resource for the analysis and characterization of MT-output quality.

Year	Venue	Keywords
2012	LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION	Machine Translation,Quality Assessments,Annotated Corpus
Field	DocType	Citations
Graph,Annotation,Ranking,FAUST,Computer science,Machine translation,Technical documentation,Speech recognition,Artificial intelligence,Natural language processing,Genetic algorithm,Slang	Conference	2
PageRank	References	Authors
0.38	4	3

Authors (3 rows)

Cited by (2 rows)

References (4 rows)

Name	Order	Citations	PageRank
daniele pighin	1	289	18.72
Màrquez, Lluís	2	2149	169.81
Lluís Formiga	3	43	6.95

1