Abstract | ||
---|---|---|
The performance of a speaker recognition system decreases when the speaker is under stress or emotion. In this paper we explore and identify a mechanism that enables use of inherent stress-in-speech or speaking style information present in speech of a person as additional cues for speaker recognition. We quantify the the inherent stress present in the speech of a speaker mainly using 3 features, namely, pitch, amplitude and duration (together called PAD) We experimentally observe that the PAD vectors of similar phones in different words of a speaker are close to each other in the three dimensional (PAD) space confirming that the way a speaker stresses different syllables in their speech is unique to them, thus we propose the use of PAD based speaking style of a speaker as an additional feature for speaker recognition applications. |
Year | DOI | Venue |
---|---|---|
2014 | 10.1109/TENCON.2009.5396003 | Singapore |
Keywords | Field | DocType |
speaker recognition,speech processing,PAD based speaking style,PAD vectors,amplitude,duration,inherent stress,pitch,speaker recognition system,speaking style information,stress information,stress-in-speech | Speech processing,Mel-frequency cepstrum,Voice analysis,Computer science,Voice activity detection,Speaker recognition system,Speech recognition,Speaker recognition,Speaker diarisation,Speaking style | Journal |
Volume | ISSN | ISBN |
abs/1410.6905 | 0886-1420 | 978-1-4244-4547-9 |
Citations | PageRank | References |
1 | 0.35 | 7 |
Authors | ||
2 |
Name | Order | Citations | PageRank |
---|---|---|---|
Laxmi Narayana M. | 1 | 1 | 0.69 |
Sunil Kumar Kopparapu | 2 | 42 | 25.18 |