Title
On-Line Speaking Rate Estimation Using Gaussian Mixture Models
Abstract
Gaussian Mixture Models (GMM) are a widespread tool in applications like speaker identification or verification. In contrast to Hidden Markov Models (HMM) Gaussian Mixture Models are designed to model the general properties of an underlying acoustic source. In our paper we extend the application of GMMs to the assessment of speaking rate. Directly trained on the acoustic data, they can be either applied directly to estimate the speech rate category or - with the help of a mapping function - they can provide a continuous measure for the speaking rate. The mapping function can be realized by means of a Neural Net. First experiments showed a correlation coefficient of 0.66 between the lexical phoneme rate and our estimation based on speech rate dependent spectral variation. Moreover, our approach can be used simultaneously for high accuracy on-line gender detection.
Year
DOI
Venue
2000
10.1109/ICASSP.2000.861830
2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI
Keywords
Field
DocType
parameter estimation,speaker recognition,loudspeakers,gaussian mixture model,speech,neural net,neural nets,training data,gaussian mixture models,gaussian processes,hidden markov model,hidden markov models,neural networks
Correlation coefficient,Speaker identification,Pattern recognition,Computer science,Speech recognition,Speaker recognition,Artificial intelligence,Gaussian process,Estimation theory,Hidden Markov model,Artificial neural network,Mixture model
Conference
ISSN
Citations 
PageRank 
1520-6149
11
0.63
References 
Authors
5
3
Name
Order
Citations
PageRank
Robert Faltlhauser1263.62
Thilo Pfau211315.74
Günther Ruske315436.13