An Expectation Maximization algorithm for textual unit alignment - Citegraph

Paper Info

Title
An Expectation Maximization algorithm for textual unit alignment

Abstract
The paper presents an Expectation Maximization (EM) algorithm for automatic generation of parallel and quasi-parallel data from any degree of comparable corpora ranging from parallel to weakly comparable. Specifically, we address the problem of extracting related textual units (documents, paragraphs or sentences) relying on the hypothesis that, in a given corpus, certain pairs of translation equivalents are better indicators of a correct textual unit correspondence than other pairs of translation equivalents. We evaluate our method on mixed types of bilingual comparable corpora in six language pairs, obtaining state of the art accuracy figures.

Year	Venue	Keywords
2011	BUCC@ACL	comparable corpus,bilingual comparable corpus,expectation maximization algorithm,better indicator,automatic generation,textual unit,certain pair,textual unit alignment,translation equivalent,art accuracy figure,expectation maximization,correct textual unit correspondence
Field	DocType	Citations
Computer science,Expectation–maximization algorithm,Speech recognition,Ranging,Natural language processing,Artificial intelligence	Conference	7
PageRank	References	Authors
0.59	7	3

Authors (3 rows)

Cited by (7 rows)

References (7 rows)

Name	Order	Citations	PageRank
Radu Ion	1	163	22.33
Alexandru Ceauşu	2	70	9.36
Elena Irimia	3	24	6.76

1