Title
Collection of phoneme samples using time alignment and spectral stationarity of speech signals
Abstract
An automatic method for collecting a large number of phoneme samples to be used as training data for speech recognition is described. Time alignment and spectral stationarity of speech signals are used to transfer phoneme labels from a hand labeled utterance of a standard speaker to a similar utterance of another speaker for whom training data are needed. Experimental results based on speech data obtained from eight male speakers show that automatically obtained training data almost yield the same phoneme recognition accuracy as hand labeled training data.
Year
DOI
Venue
1985
10.1109/ICASSP.1985.1168214
Acoustics, Speech, and Signal Processing, IEEE International Conference ICASSP '85.
Keywords
DocType
Volume
physics,prototypes,training data,viterbi algorithm,automatic speech recognition,loudspeakers,speech recognition
Conference
10
Citations 
PageRank 
References 
0
0.34
6
Authors
2
Name
Order
Citations
PageRank
Haltsonen, S.100.34
Ruusunen, P.200.34