Title
Training data selection for improving discriminative training of acoustic models
Abstract
This paper considers training data selection for discriminative training of acoustic models for large vocabulary continuous speech recognition (LVCSR). Three novel data selection approaches are proposed. First, the average phone accuracy over all hypothesized word sequences in the word lattice of a training utterance is utilized for utterance-level data selection. Second, phone-level data selection based on the difference between the expected accuracy of a phone arc and the average phone accuracy of the word lattice is investigated. Finally, frame-level data selection based on the normalized frame-level entropy of Gaussian posterior probabilities obtained from the word lattice is explored. The underlying characteristics of the presented approaches are extensively investigated and their performance is verified by comparison with standard discriminative training approaches. Experiments conducted on a broadcast news speech transcription task show that with the aid of phone- and frame-level data selection we can reduce more than half of the turnaround time for acoustic model training and simultaneously obtain a comparably good set of discriminative acoustic models.
Year
DOI
Venue
2009
10.1016/j.patrec.2009.05.009
Pattern Recognition Letters
Keywords
Field
DocType
word lattice,phone accuracy,phone-level data selection,discriminative training,training data selection,frame-level data selection,continuous speech recognition,standard discriminative training approach,novel data selection approach,utterance-level data selection,average phone accuracy,data selection,acoustic models,acoustic model training,entropy,posterior probability
Speech processing,Pattern recognition,Computer science,Posterior probability,Speech recognition,Phone,Artificial intelligence,Linear discriminant analysis,Vocabulary,Discriminative model,Acoustic model,Cable television
Journal
Volume
Issue
ISSN
30
13
Pattern Recognition Letters
Citations 
PageRank 
References 
13
0.66
25
Authors
3
Name
Order
Citations
PageRank
Berlin Chen147937.69
Shih-Hung Liu26614.53
Fang-hui Chu3302.05