Title
On-Line Speaker Enrollment using Rhythmical Voices for Human Robot Interaction
Abstract
In this study, we present a simple on-line speaker enrollment and identification among human-robot interaction (HRI) with intelligent service robots. For this purpose, speaker enrollment is performed through rhythmical singing voices or a simple game such as paper-scissors-rock. While the conventional enrollment methods frequently used in the security area should be cooperative, the proposed approach can be enrolled in a very natural way. After enrolling, the text-independent speaker recognition is accomplished by using the well-known mel-frequency cepstral coefficients (MFCC) and Gaussian mixture models (GMM). The experimental results reveal that the proposed approach yields better recognition performance in comparison to the results obtained by the conventional enrollment method.
Year
DOI
Venue
2007
10.1109/ROMAN.2007.4415208
RO-MAN
Keywords
Field
DocType
mel-frequency cepstral coefficients,intelligent service robots,online speaker enrollment,human robot interaction,rhythmical singing voices,text-independent speaker recognition,speech-based user interfaces,service robots,speech synthesis,cepstral analysis,speaker recognition,intelligent robots,speaker identification,gaussian processes,gaussian mixture models,paper-scissors-rock,gaussian mixture model,mel frequency cepstral coefficient
Mel-frequency cepstrum,Speech synthesis,Computer science,Speech recognition,Speaker recognition,Speaker diarisation,Gaussian process,Robot,Mixture model,Human–robot interaction
Conference
ISBN
Citations 
PageRank 
978-1-4244-1635-6
0
0.34
References 
Authors
5
4
Name
Order
Citations
PageRank
Kyungsook Bae100.68
Hye-jin Kim2516.18
Keun-chang Kwak336132.96
Hosub Yoon400.34