Title
Training data selection for improving discriminative training of acoustic models
Abstract
This paper considers training data selection for discriminative training of acoustic models for broadcast news speech recognition. Three novel data selection approaches were proposed. First, the average phone accuracy over all hypothesized word sequences in the word lattice of a training utterance was utilized for utterance-level data selection. Second, phone-level data selection based on the difference between the expected accuracy of a phone arc and the average phone accuracy of the word lattice was investigated. Finally, frame-level data selection based on the normalized frame-level entropy of Gaussian posterior probabilities obtained from the word lattice was explored. The underlying characteristics of the presented approaches were extensively investigated and their performance was verified by comparison with the standard discriminative training approaches. Experiments conducted on the Mandarin broadcast news collected in Taiwan shown that both phone-and frame-level data selection could achieve slight but consistent improvements over the baseline systems at lower training iterations.
Year
DOI
Venue
2007
10.1109/ASRU.2007.4430125
ASRU
Keywords
Field
DocType
broadcast news speech recognition,speech recognition,acoustic model,utterance-level data selection,phone-level data selection,hypothesized word sequence,acoustic models,data selection,normalized frame-level entropy,discriminative training,gaussian processes,frame-level data selection,word lattice,gaussian posterior probability,entropy,probability,posterior probability
Broadcasting,Normalization (statistics),Pattern recognition,Computer science,Speech recognition,Posterior probability,Phone,Gaussian,Artificial intelligence,Gaussian process,Discriminative model,Mandarin Chinese
Conference
ISBN
Citations 
PageRank 
978-1-4244-1746-9
10
0.54
References 
Authors
14
5
Name
Order
Citations
PageRank
Shih-Hung Liu16614.53
Fang-hui Chu2302.05
Shih-Hsiang Lin314214.07
Hung-Shin Lee4539.76
Berlin Chen547937.69