Title
Investigating Data Selection For Minimum Phone Error Training Of Acoustic Models
Abstract
This paper considers minimum phone error (MPE) based discriminative training of acoustic models for Mandarin broadcast news recognition. A novel data selection approach based on the normalized frame-level entropy of Gaussian posterior probabilities obtained from the word lattice of the training utterance was explored. It has the merit of making the training algorithm focus much more on the training statistics of those frame samples that center nearly around the decision boundary for better discrimination. Moreover, we presented a new phone accuracy function based on the frame-level accuracy of hypothesized phone arcs instead of using the raw phone accuracy function of MPE training. The underlying characteristics of the presented approaches were extensively investigated and their performance was verified by comparison with the original MPE training approach. Experiments conducted on the broadcast news collected in Taiwan showed that the integration of the frame-level data selection and accuracy calculation could achieve slight but consistent improvements over the baseline system.
Year
DOI
Venue
2007
10.1109/ICME.2007.4284658
2007 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-5
Keywords
Field
DocType
gaussian processes,computer science,acoustical engineering,broadcasting,lattices,hidden markov models,posterior probability,speech recognition,entropy,automatic speech recognition
Broadcasting,Normalization (statistics),Pattern recognition,Computer science,Posterior probability,Speech recognition,Gaussian,Phone,Gaussian process,Artificial intelligence,Decision boundary,Discriminative model
Conference
Citations 
PageRank 
References 
5
0.47
6
Authors
4
Name
Order
Citations
PageRank
Shih-Hung Liu16614.53
Fang-hui Chu2302.05
Shih-Hsiang Lin314214.07
Berlin Chen447937.69