Smart City Gnosys

Smart city article details

Title	Analysis Of Rnns And Different Ml And Dl Classifiers On Speech-Based Emotion Recognition System Using Linear And Nonlinear Features
ID_Doc	9267
Authors	Jha S.; Shah S.; Ghamsani R.; Sanghavi P.; Shekokar N.M.
Year	2022
Published	Recurrent Neural Networks: Concepts and Applications
DOI	http://dx.doi.org/10.1201/9781003307822-9
Abstract	Speech recognition is an important research topic that helps understand human emotion and the semantics associated with it. The emotional state of a person is recognized using speech emotion recognition systems. These systems do not take into account the semantic aspects of the person. This task of understanding human emotions using programmed architectures like applications or websites is an active part of ongoing research. It aids in a wide array of applications ranging from human-computer interface models (HCIs) to medical diagnosis of mental health. Most of the existing research fails to classify emotions for the given dataset efficiently; moreover, the feature extraction methodology is absent in related work. In this chapter, we aim to study the linear and non linear features of human speech and its classification into different emotions like anger, sad, pleasant surprise, happy, and relaxed state. By properly distinguishing different human emotions using emotion classifiers, we aim to develop smart cities using efficient and real-time methods of emotion detection through cell phone conversations, call center applications, cabdrivers, and other machine-dependent communication sources. We realize that our proposed architecture is mainly useful in environments that are assisted by interaction devices, but our work can potentially also be applied in the medical field where the treatments or reactions to medicine can be studied. We aim to analyze different classifiers on our dataset by using classifiers such as SVM, Random Forest, RNN, MLP, Gradient Boosting Classifier, KNN, Bagging Classifier, etc. based on different algorithmic metrics. The dataset is a mixture of two different databases containing recordings from multiple actors and actresses. Filtering and analyses of different classifiers along with their confidence matrix are studied. During experimentation, the MLP classifier considerably outperforms all the other classifiers with its accuracy accompanied by neural networks that also show promising results. © Ron Waddams/Bridgeman Images.
Author Keywords

Similar Articles

Id	Similarity	Authors	Title	Published
37197	0.86	Mridha K.; Islam M.D.I.; Shorna M.M.; Priyok M.A.	Ml-Dp: A Smart Emotion Detection System For Disabled Person To Develop A Smart City	2022 10th International Conference on Reliability, Infocom Technologies and Optimization (Trends and Future Directions), ICRITO 2022 (2022)
52744	0.859	Sora C.J.; Alkhatib M.	Speech Sentiment Analysis For Citizen'S Engagement In Smart Cities' Events	2022 7th International Conference on Smart and Sustainable Technologies, SpliTech 2022 (2022)
26100	0.859	Badr O.S.; Ibrahim N.; Elmougy A.	Fake Emotion Detection Using Affective Cues And Speech Emotion Recognition For Improved Human Computer Interaction	2023 2nd International Conference on Smart Cities 4.0, Smart Cities 4.0 2023 (2023)
5244	0.858	Costa W.; Talavera E.; Oliveira R.; Figueiredo L.; Teixeira J.M.; Lima J.P.; Teichrieb V.	A Survey On Datasets For Emotion Recognition From Vision: Limitations And In-The-Wild Applicability	Applied Sciences (Switzerland), 13, 9 (2023)