Smart City Gnosys

Smart city article details

Title Scalable Neural Architectures For End-To-End Environmental Sound Classification
ID_Doc 47334
Authors Paissan F.; Ancilotto A.; Brutti A.; Farella E.
Year 2022
Published ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 2022-May
DOI http://dx.doi.org/10.1109/ICASSP43922.2022.9746093
Abstract Sound Event Detection (SED) is a complex task simulating human ability to recognize what is happening in the surrounding from auditory signals only. This technology is a crucial asset in many applications such as smart cities. Here, urban sounds can be detected and processed by embedded devices in an Internet of Things (IoT) to identify meaningful events for municipalities or law enforcement. However, while current deep learning techniques for SED are effective, they are also resource- and power-hungry, thus not appropriate for pervasive battery-powered devices. In this paper, we propose novel neural architectures based on PhiNets for real-time acoustic event detection on microcontroller units. The proposed models are easily scalable to fit the hardware requirements and can operate both on spectrograms and waveforms. In particular, our architectures achieve state-of-the-art performance on UrbanSound8K in spectrogram classification (around 77%) with extreme compression factors (99.8%) with respect to current state-of-the-art architectures. © 2022 IEEE
Author Keywords IoT; scalable backbone; sound event detection; tinyML


Similar Articles


Id Similarity Authors Title Published
40858 View0.949Brutti A.; Paissan F.; Ancilotto A.; Farella E.Optimizing Phinet Architectures For The Detection Of Urban Sounds On Low-End DevicesEuropean Signal Processing Conference, 2022-August (2022)
52320 View0.913Cerutti G.; Andri R.; Cavigelli L.; Farella E.; Magno M.; Benini L.Sound Event Detection With Binary Neural Networks On Tightly Power-Constrained Iot DevicesACM International Conference Proceeding Series (2020)
24672 View0.896Lamrini M.; Chkouri M.Y.; Touhafi A.Evaluating The Performance Of Pre-Trained Convolutional Neural Network For Audio Classification On Embedded Systems For Anomaly Detection In Smart CitiesSensors, 23, 13 (2023)
8076 View0.875Mukhamadiyev A.; Khujayarov I.; Nabieva D.; Cho J.An Ensemble Of Convolutional Neural Networks For Sound Event DetectionMathematics, 13, 9 (2025)
52319 View0.875Sammarco M.; Stellantis T.Z.; Gantert L.; Campista M.E.M.Sound Event Detection Via Pervasive Devices For Mobility Surveillance In Smart Cities2024 IEEE International Conference on Pervasive Computing and Communications Workshops and other Affiliated Events, PerCom Workshops 2024 (2024)
52316 View0.872Bello J.P.; Mydlarz C.; Salamon J.Sound Analysis In Smart CitiesComputational Analysis of Sound Scenes and Events (2017)
52318 View0.871Nogueira A.F.R.; Oliveira H.S.; Machado J.J.M.; Tavares J.M.R.S.Sound Classification And Processing Of Urban Environments: A Systematic Literature ReviewSensors, 22, 22 (2022)
39492 View0.87Hajihashemi V.; Alavigharahbagh A.; Machado J.J.M.; Tavares J.M.R.S.Novel Sound Event And Sound Activity Detection Framework Based On Intrinsic Mode Functions And Deep LearningMultimedia Tools and Applications, 84, 14 (2025)
44289 View0.87Saradopoulos I.; Potamitis I.; Ntalampiras S.; Rigakis I.; Manifavas C.; Konstantaras A.Real-Time Acoustic Detection Of Critical Incidents In Smart Cities Using Artificial Intelligence And Edge NetworksSensors, 25, 8 (2025)
59656 View0.868Hidayat A.; Njoo D.B.P.; Adrian G.D.; Setyoko D.E.; Wijanarko B.D.Unlocking Soundscapes: Harnessing Machine Learning For Sound ClassificationProceeding of 2024 9th International Conference on Information Technology and Digital Applications, ICITDA 2024 (2024)