Minimum risk acoustic clustering for multilingual acoustic model combination - Citegraph

Paper Info

Title
Minimum risk acoustic clustering for multilingual acoustic model combination

Abstract
In this paper we describe procedures for combining multiple acoustic models, obtained using training corpora from different languages, in order to improve ASR performance in languages for which large amounts of training data are not available. We treat these models as multiple sources of information whose scores are combined in a log-linear model to compute the hypothesis likelihood. The model combination can either be performed in a static way, with constant combination weights, or in a dynamic way, with parameters that can vary for different segments of a hy- pothesis. The aim is to optimize the parameters so as to achieve minimum word error rate. In order to achieve robust parameter estimation in the dynamic combination case, the parameters are defined to be piecewise constant on different phonetic classes that form a partition of the space of hypothesis segments. The parti- tion is defined, using phonological knowledge, on segments that correspond to hypothesized phones. We examine different ways to define such a partition, including an automatic approach that gives a binary tree structured partition which tries to achieve the minimum WER with the minimum number of classes.

Year	Venue	Keywords
2000	INTERSPEECH	word error rate,log linear model,binary tree,parameter estimation
Field	DocType	Citations
Pattern recognition,Computer science,Speech recognition,Artificial intelligence,Cluster analysis,Acoustic model	Conference	6
PageRank	References	Authors
0.85	6	3

Authors (3 rows)

Cited by (6 rows)

References (6 rows)

Name	Order	Citations	PageRank
Dimitra Vergyri	1	373	36.97
Stavros Tsakalidis	2	213	13.83
William Byrne	3	245	33.80

1