Smart City Gnosys

Smart city article details

Title Classification Of Environmental And Urban Sounds Using Deep Learning Techniques
ID_Doc 14284
Authors Reddy B.S.; Chowdary D.M.; Srinivas R.; Rahmani M.O.
Year 2025
Published 4th IEEE International Conference on Distributed Computing and Electrical Circuits and Electronics, ICDCECE 2025
DOI http://dx.doi.org/10.1109/ICDCECE65353.2025.11036020
Abstract Environmental sound classification (ESC) is an important task in machine learning, with broad applications ranging from urban monitoring to wildlife protection and smart cities. By enabling machines to recognize and categorize sounds from the environment, ESC systems help automate sound detection and recognition. ESC systems often struggle with noise and the diverse nature of sounds, existing methods rely on complex feature extraction processes sometimes including multiple intermediate stages. This paper proposes a transfer learning approach using the Audio Spectrogram Transformer (AST), an attention-based transformer network for audio classification. Proposed method is to combine it with a Sequential block containing LSTM, Linear, and Dropout layers. Unlike other approaches that involve multiple feature extractions, AST can directly convert the spectrogram images of audio into its features, similar to the Vision-Transformer (ViT). The model was evaluated on ESC-10, ESC-50, and UrbanSound8k datasets. The results confirm that combining transformers and LSTM shows better results in sound classification. © 2025 IEEE.
Author Keywords Attention; Environmental Sound Classification; LSTM; Neural Network; Spectrogram; Transformer


Similar Articles


Id Similarity Authors Title Published
24259 View0.896Das J.K.; Chakrabarty A.; Piran M.J.Environmental Sound Classification Using Convolution Neural Networks With Different Integrated Loss FunctionsExpert Systems, 39, 5 (2022)
33493 View0.896Özseven T.Investigation Of The Effectiveness Of Time-Frequency Domain Images And Acoustic Features In Urban Sound ClassificationApplied Acoustics, 211 (2023)
24260 View0.895Seresht H.R.; Mohammadi K.Environmental Sound Classification With Low-Complexity Convolutional Neural Network Empowered By Sparse Salient Region PoolingIEEE Access, 11 (2023)
14555 View0.892Seker H.; Inik O.Cnnsound: Convolutional Neural Networks For The Classification Of Environmental SoundsACM International Conference Proceeding Series (2020)
14548 View0.889İnik Ö.Cnn Hyper-Parameter Optimization For Environmental Sound ClassificationApplied Acoustics, 202 (2023)
24672 View0.886Lamrini M.; Chkouri M.Y.; Touhafi A.Evaluating The Performance Of Pre-Trained Convolutional Neural Network For Audio Classification On Embedded Systems For Anomaly Detection In Smart CitiesSensors, 23, 13 (2023)
7700 View0.881Zhang D.; Zhong Z.; Xia Y.; Wang Z.; Xiong W.An Automatic Classification System For Environmental Sound In Smart CitiesSensors, 23, 15 (2023)
48795 View0.874Liu Z.; Yeh W.-C.Simplified Swarm Optimisation For Cnn Hyperparameters: A Sound Classification ApproachInternational Journal of Web and Grid Services, 20, 1 (2024)
24258 View0.872Cui Q.; Zhao K.; Wang L.; Gao K.; Cao F.; Wang X.Environmental Sound Classification Based On Knowledge DistillationInternational Conference on Signal Processing Proceedings, ICSP, 2022-October (2022)
52318 View0.87Nogueira A.F.R.; Oliveira H.S.; Machado J.J.M.; Tavares J.M.R.S.Sound Classification And Processing Of Urban Environments: A Systematic Literature ReviewSensors, 22, 22 (2022)