On-Line Speaking Rate Estimation Using Gaussian Mixture Models - Citegraph

Paper Info

Title
On-Line Speaking Rate Estimation Using Gaussian Mixture Models

Abstract
Gaussian Mixture Models (GMM) are a widespread tool in applications like speaker identification or verification. In contrast to Hidden Markov Models (HMM) Gaussian Mixture Models are designed to model the general properties of an underlying acoustic source. In our paper we extend the application of GMMs to the assessment of speaking rate. Directly trained on the acoustic data, they can be either applied directly to estimate the speech rate category or - with the help of a mapping function - they can provide a continuous measure for the speaking rate. The mapping function can be realized by means of a Neural Net. First experiments showed a correlation coefficient of 0.66 between the lexical phoneme rate and our estimation based on speech rate dependent spectral variation. Moreover, our approach can be used simultaneously for high accuracy on-line gender detection.

Year	DOI	Venue
2000	10.1109/ICASSP.2000.861830	2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI
Keywords	Field	DocType
parameter estimation,speaker recognition,loudspeakers,gaussian mixture model,speech,neural net,neural nets,training data,gaussian mixture models,gaussian processes,hidden markov model,hidden markov models,neural networks	Correlation coefficient,Speaker identification,Pattern recognition,Computer science,Speech recognition,Speaker recognition,Artificial intelligence,Gaussian process,Estimation theory,Hidden Markov model,Artificial neural network,Mixture model	Conference
ISSN	Citations	PageRank
1520-6149	11	0.63
References	Authors
5	3

Authors (3 rows)

Cited by (11 rows)

References (5 rows)

Name	Order	Citations	PageRank
Robert Faltlhauser	1	26	3.62
Thilo Pfau	2	113	15.74
Günther Ruske	3	154	36.13

1