Title
Minimum risk acoustic clustering for multilingual acoustic model combination
Abstract
In this paper we describe procedures for combining multiple acoustic models, obtained using training corpora from different languages, in order to improve ASR performance in languages for which large amounts of training data are not available. We treat these models as multiple sources of information whose scores are combined in a log-linear model to compute the hypothesis likelihood. The model combination can either be performed in a static way, with constant combination weights, or in a dynamic way, with parameters that can vary for different segments of a hy- pothesis. The aim is to optimize the parameters so as to achieve minimum word error rate. In order to achieve robust parameter estimation in the dynamic combination case, the parameters are defined to be piecewise constant on different phonetic classes that form a partition of the space of hypothesis segments. The parti- tion is defined, using phonological knowledge, on segments that correspond to hypothesized phones. We examine different ways to define such a partition, including an automatic approach that gives a binary tree structured partition which tries to achieve the minimum WER with the minimum number of classes.
Year
Venue
Keywords
2000
INTERSPEECH
word error rate,log linear model,binary tree,parameter estimation
Field
DocType
Citations 
Pattern recognition,Computer science,Speech recognition,Artificial intelligence,Cluster analysis,Acoustic model
Conference
6
PageRank 
References 
Authors
0.85
6
3
Name
Order
Citations
PageRank
Dimitra Vergyri137336.97
Stavros Tsakalidis221313.83
William Byrne324533.80