Smart City Gnosys

Smart city article details

Title Environmental Sound Classification Using Convolution Neural Networks With Different Integrated Loss Functions
ID_Doc 24259
Authors Das J.K.; Chakrabarty A.; Piran M.J.
Year 2022
Published Expert Systems, 39, 5
DOI http://dx.doi.org/10.1111/exsy.12804
Abstract The hike in the demand for smart cities has gathered the interest of researchers to work on environmental sound classification. Most researchers' goal is to reach the Bayesian optimal error in the field of audio classification. Nonetheless, it is very baffling to interpret meaning from a three-dimensional audio and this is where different types of spectrograms become effective. Using benchmark spectral features such as mel frequency cepstral coefficients (MFCCs), chromagram, log-mel spectrogram (LM), and so on audio can be converted into meaningful 2D spectrograms. In this paper, we propose a convolutional neural network (CNN) model, which is fabricated with additive angular margin loss (AAML), large margin cosine loss (LMCL) and a-softmax loss. These loss functions proposed for face recognition, hold their value in the other fields of study if they are implemented in a systematic manner. The mentioned loss functions are more dominant than conventional softmax loss when it comes to classification task because of its capability to increase intra-class compactness and inter-class discrepancy. Thus, with MCAAM-Net, MCAS-Net and MCLCM-Net models, a classification accuracy of 99.60%, 99.43% and 99.37% is achieved on UrbanSound8K dataset respectively without any augmentation. This paper also demonstrates the benefit of stacking features together and the above-mentioned validation accuracies are achieved after stacking MFCCs and chromagram on the x-axis. We also visualized the clusters formed by the embedded vectors of test data for further acknowledgement of our results, after passing it through different proposed models. Finally, we show that the MCAAM-Net model achieved an accuracy of 99.60% on UrbanSound8K dataset, which outperforms the benchmark models like TSCNN-DS, ADCNN-5, ESResNet-Attention, and so on that are introduced over the recent years. © 2021 John Wiley & Sons Ltd.
Author Keywords additive angular margin; angular softmax; chromagram; large margin cosine; mel frequency cepstral coefficient; smart city


Similar Articles


Id Similarity Authors Title Published
14555 View0.923Seker H.; Inik O.Cnnsound: Convolutional Neural Networks For The Classification Of Environmental SoundsACM International Conference Proceeding Series (2020)
24260 View0.907Seresht H.R.; Mohammadi K.Environmental Sound Classification With Low-Complexity Convolutional Neural Network Empowered By Sparse Salient Region PoolingIEEE Access, 11 (2023)
14305 View0.905Agarwal M.; Gill K.S.; Chattopadhyay S.; Singh M.Classification Of Urban Sound Using Sequential Convolutional Neural Network (Cnn) Model And Its Visualisation2024 IEEE International Conference on Information Technology, Electronics and Intelligent Communication Systems, ICITEICS 2024 (2024)
33493 View0.896Özseven T.Investigation Of The Effectiveness Of Time-Frequency Domain Images And Acoustic Features In Urban Sound ClassificationApplied Acoustics, 211 (2023)
7700 View0.896Zhang D.; Zhong Z.; Xia Y.; Wang Z.; Xiong W.An Automatic Classification System For Environmental Sound In Smart CitiesSensors, 23, 15 (2023)
14284 View0.896Reddy B.S.; Chowdary D.M.; Srinivas R.; Rahmani M.O.Classification Of Environmental And Urban Sounds Using Deep Learning Techniques4th IEEE International Conference on Distributed Computing and Electrical Circuits and Electronics, ICDCECE 2025 (2025)
14548 View0.892İnik Ö.Cnn Hyper-Parameter Optimization For Environmental Sound ClassificationApplied Acoustics, 202 (2023)
60186 View0.891Agarwal M.; Gill K.S.; Aggarwal P.; Rawat R.S.; Sunil G.Urban Sound Classification Using Vgg19 Convolutional Neural Network (Cnn) Model And Its Visualisation4th International Conference on Innovative Practices in Technology and Management 2024, ICIPTM 2024 (2024)
58279 View0.888Vijay M.; Ruthwik Saran K.; Reddy K.R.; Aditya Ram K.; Babu J.Y.Towards Robust Environmental Sound Classification: A Deep Learning Approach Leveraging Time-Frequency Representations2nd International Conference on Emerging Research in Computational Science, ICERCS 2024 (2024)
52335 View0.887Chatterjee R.; Bishwas P.; Chakrabarty S.; Bandyopadhyay T.South Asian Sounds: Audio ClassificationProceedings - 2024 4th International Conference on Computer, Communication, Control and Information Technology, C3IT 2024 (2024)