Title | ||
---|---|---|
A speech emotion recognition framework based on latent Dirichlet allocation: Algorithm and FPGA implementation |
Abstract | ||
---|---|---|
In this paper, we present a speech-based emotion recognition framework based on a latent Dirichlet allocation model. This method assumes that incoming speech frames are conditionally independent and exchangeable. While this leads to a loss of temporal structure, it is able to capture significant statistical information between frames. In contrast, a hidden Markov model-based approach captures the temporal structure in speech. Using the German emotional speech database EMO-DB for evaluation, we achieve an average classification accuracy of 80.7% compared to 73% for hidden Markov models. This improvement is achieved at the cost of a slight increase in computational complexity. We map the proposed algorithm onto an FPGA platform and show that emotions in a speech utterance of duration 1.5s can be identified in 1.8ms, while utilizing 70% of the resources. This further demonstrates the suitability of our approach for real-time applications on hand-held devices. |
Year | DOI | Venue |
---|---|---|
2013 | 10.1109/ICASSP.2013.6638116 | Acoustics, Speech and Signal Processing |
Keywords | Field | DocType |
computational complexity,emotion recognition,field programmable gate arrays,hidden Markov models,speech recognition,EMO-DB,FPGA platform,German emotional speech database,computational complexity,hand-held devices,hidden Markov models,latent Dirichlet allocation model,speech emotion recognition framework,speech utterance,statistical information,temporal structure,time 1.5 s,time 1.8 ms,FPGA implementation,affective computing,emotion recognition,latent Dirichlet allocation | Latent Dirichlet allocation,Computer science,Utterance,Speaker recognition,Artificial intelligence,Pattern recognition,Conditional independence,Field-programmable gate array,Algorithm,Speech recognition,Affective computing,Hidden Markov model,Computational complexity theory | Conference |
ISSN | Citations | PageRank |
1520-6149 | 6 | 0.47 |
References | Authors | |
11 | 4 |
Name | Order | Citations | PageRank |
---|---|---|---|
Mohit Shah | 1 | 27 | 3.11 |
Lifeng Miao | 2 | 23 | 3.43 |
Chaitali Chakrabarti | 3 | 1978 | 184.17 |
Andreas S. Spanias | 4 | 528 | 87.90 |