Smart City Gnosys

Smart city article details

Title Analysis Of Rnns And Different Ml And Dl Classifiers On Speech-Based Emotion Recognition System Using Linear And Nonlinear Features
ID_Doc 9267
Authors Jha S.; Shah S.; Ghamsani R.; Sanghavi P.; Shekokar N.M.
Year 2022
Published Recurrent Neural Networks: Concepts and Applications
DOI http://dx.doi.org/10.1201/9781003307822-9
Abstract Speech recognition is an important research topic that helps understand human emotion and the semantics associated with it. The emotional state of a person is recognized using speech emotion recognition systems. These systems do not take into account the semantic aspects of the person. This task of understanding human emotions using programmed architectures like applications or websites is an active part of ongoing research. It aids in a wide array of applications ranging from human-computer interface models (HCIs) to medical diagnosis of mental health. Most of the existing research fails to classify emotions for the given dataset efficiently; moreover, the feature extraction methodology is absent in related work. In this chapter, we aim to study the linear and non linear features of human speech and its classification into different emotions like anger, sad, pleasant surprise, happy, and relaxed state. By properly distinguishing different human emotions using emotion classifiers, we aim to develop smart cities using efficient and real-time methods of emotion detection through cell phone conversations, call center applications, cabdrivers, and other machine-dependent communication sources. We realize that our proposed architecture is mainly useful in environments that are assisted by interaction devices, but our work can potentially also be applied in the medical field where the treatments or reactions to medicine can be studied. We aim to analyze different classifiers on our dataset by using classifiers such as SVM, Random Forest, RNN, MLP, Gradient Boosting Classifier, KNN, Bagging Classifier, etc. based on different algorithmic metrics. The dataset is a mixture of two different databases containing recordings from multiple actors and actresses. Filtering and analyses of different classifiers along with their confidence matrix are studied. During experimentation, the MLP classifier considerably outperforms all the other classifiers with its accuracy accompanied by neural networks that also show promising results. © Ron Waddams/Bridgeman Images.
Author Keywords


Similar Articles


Id Similarity Authors Title Published
37197 View0.86Mridha K.; Islam M.D.I.; Shorna M.M.; Priyok M.A.Ml-Dp: A Smart Emotion Detection System For Disabled Person To Develop A Smart City2022 10th International Conference on Reliability, Infocom Technologies and Optimization (Trends and Future Directions), ICRITO 2022 (2022)
52744 View0.859Sora C.J.; Alkhatib M.Speech Sentiment Analysis For Citizen'S Engagement In Smart Cities' Events2022 7th International Conference on Smart and Sustainable Technologies, SpliTech 2022 (2022)
26100 View0.859Badr O.S.; Ibrahim N.; Elmougy A.Fake Emotion Detection Using Affective Cues And Speech Emotion Recognition For Improved Human Computer Interaction2023 2nd International Conference on Smart Cities 4.0, Smart Cities 4.0 2023 (2023)
5244 View0.858Costa W.; Talavera E.; Oliveira R.; Figueiredo L.; Teixeira J.M.; Lima J.P.; Teichrieb V.A Survey On Datasets For Emotion Recognition From Vision: Limitations And In-The-Wild ApplicabilityApplied Sciences (Switzerland), 13, 9 (2023)