Title
A Perspective Study on Speech Emotion Recognition: Databases, Features and Classification Models
Abstract
Automatic Speech Recognition (ASR) is a popular research area with many variations in human behaviour functionalities and interactions. Human beings want speech for communication and Conversations. When the conversation is going on, the information or message of the speech utterances is transferred. It also consists of message which includes speaker's traits like emotion, his or her physiological characteristics and environmental statistics. There is a tremendous number of signals or records that are complex and encoded, but these can be decoded quickly because of human intelligence. Many academics in the domain of Human Computer Interaction (HCI) are working to automate speech generation and the extraction of speech attributes and meaning. For example, ASR can regulate the usage of voice command and maintain dictation discipline while also recognizing and verifying the speech of the speaker. As a result of accent and nativity traits, the speaker's emotional state can be discerned from the speech. In this Paper, we discussed Speech Production System of Human, Research Problems in Speech Processing, SER system Motivation, Challenges and Objectives of Speech Emotion Recognition, so far the work done on Telugu Speech Emotion Databases and their role thoroughly explained. In this Paper, our own Created Database i.e., (DETL) Database for Emotions in Telugu Language and the software Audacity for creating that database is discussed clearly.
Year
DOI
Venue
2021
10.18280/ts.380631
TRAITEMENT DU SIGNAL
Keywords
DocType
Volume
ASR, HCI, SER, Telugu emotional speech, acoustic, SVM, MLP, CNN
Journal
38
Issue
ISSN
Citations 
6
0765-0019
0
PageRank 
References 
Authors
0.34
0
2
Name
Order
Citations
PageRank
Kogila Raghu100.34
Manchala Sadanandam200.34