Abstract | ||
---|---|---|
In this study, we present a simple on-line speaker enrollment and identification among human-robot interaction (HRI) with intelligent service robots. For this purpose, speaker enrollment is performed through rhythmical singing voices or a simple game such as paper-scissors-rock. While the conventional enrollment methods frequently used in the security area should be cooperative, the proposed approach can be enrolled in a very natural way. After enrolling, the text-independent speaker recognition is accomplished by using the well-known mel-frequency cepstral coefficients (MFCC) and Gaussian mixture models (GMM). The experimental results reveal that the proposed approach yields better recognition performance in comparison to the results obtained by the conventional enrollment method. |
Year | DOI | Venue |
---|---|---|
2007 | 10.1109/ROMAN.2007.4415208 | RO-MAN |
Keywords | Field | DocType |
mel-frequency cepstral coefficients,intelligent service robots,online speaker enrollment,human robot interaction,rhythmical singing voices,text-independent speaker recognition,speech-based user interfaces,service robots,speech synthesis,cepstral analysis,speaker recognition,intelligent robots,speaker identification,gaussian processes,gaussian mixture models,paper-scissors-rock,gaussian mixture model,mel frequency cepstral coefficient | Mel-frequency cepstrum,Speech synthesis,Computer science,Speech recognition,Speaker recognition,Speaker diarisation,Gaussian process,Robot,Mixture model,Human–robot interaction | Conference |
ISBN | Citations | PageRank |
978-1-4244-1635-6 | 0 | 0.34 |
References | Authors | |
5 | 4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Kyungsook Bae | 1 | 0 | 0.68 |
Hye-jin Kim | 2 | 51 | 6.18 |
Keun-chang Kwak | 3 | 361 | 32.96 |
Hosub Yoon | 4 | 0 | 0.34 |