Smart City Gnosys

Smart city article details

Title An Investigational Analysis Of Automatic Speech Recognition On Deep Neural Networks And Gated Recurrent Unit Model
ID_Doc 8662
Authors Soundarya M.; Anusuya S.
Year 2024
Published Lecture Notes in Networks and Systems, 892
DOI http://dx.doi.org/10.1007/978-981-99-9521-9_4
Abstract For thousands of years, communication has played a crucial role in human existence, development, and globalization. Speech recognition has several uses, including biometric analysis, education, security, health care, and smart cities. Many scientists have spent years studying how machine learning may be applied to speech processing, particularly voice recognition. But in recent years, researchers have concentrated on ways to apply deep learning to problems involving human speech. In this post, we discuss our work using deep neural networks like CRNN and GRU to recognize audio samples in spoken language. Seven different classes of audio samples (Walk & footsteps, Kids speaking, Filling with water, Bass drum, Scissors, Clock, and Cough) were employed in Free Sound Datasets. Mel-spectral coefficients, along with other spectral and intensity-related factors, are among the feature parameters utilized for recognition. White noise and a retuned voice were employed as data augmentation. An average recognition rate of accuracy 93.25% and WER—Word Error Rate—of 7.84% were obtained by the GRU model, according to the findings. © The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2024.
Author Keywords And data augmentation; Deep learning; Feature extraction; Gated recurrent unit; MFCC; Speech recognition


Similar Articles


Id Similarity Authors Title Published
19236 View0.886Hattaraki S.M.; Kambalimath S.G.; Savukar B.P.; Bagali S.; Dixit U.D.; Jadhav A.S.Detection And Classification Of Various Listening Environments For Hearing-Impaired Individuals Using Crnn2024 International Conference on Innovation and Novelty in Engineering and Technology, INNOVA 2024 - Proceedings (2024)
8076 View0.865Mukhamadiyev A.; Khujayarov I.; Nabieva D.; Cho J.An Ensemble Of Convolutional Neural Networks For Sound Event DetectionMathematics, 13, 9 (2025)
17908 View0.854Ciaburro G.Deep Learning Methods For Audio Events DetectionStudies in Big Data, 82 (2021)