Smart City Gnosys

Smart city article details

Title Graph-Enhanced Dual-Stream Feature Fusion With Pre-Trained Model For Acoustic Traffic Monitoring
ID_Doc 28293
Authors Fan S.; Xiao F.; Wang W.; Qi S.; Zhu Q.; Wang W.; Guan J.
Year 2025
Published ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
DOI http://dx.doi.org/10.1109/ICASSP49660.2025.10889208
Abstract Microphone array techniques are widely used in sound source localization and smart city acoustic-based traffic monitoring, but these applications face significant challenges due to the scarcity of labeled real-world traffic audio data and the complexity and diversity of application scenarios. The DCASE Challenge's Task 10 focuses on using multi-channel audio signals to count vehicles (cars or commercial vehicles) and identify their directions (left-to-right or vice versa). In this paper, we propose a graph-enhanced dual-stream feature fusion network (GEDF-Net) for acoustic traffic monitoring, which simultaneously considers vehicle type and direction to improve detection. We propose a graph-enhanced dual-stream feature fusion strategy which consists of a vehicle type feature extraction (VTFE) branch, a vehicle direction feature extraction (VDFE) branch, and a frame-level feature fusion module to combine the type and direction feature for enhanced performance. A pre-trained model (PANNs) is used in the VTFE branch to mitigate data scarcity and enhance the type features, followed by a graph attention mechanism to exploit temporal relationships and highlight important audio events within these features. The frame-level fusion of direction and type features enables fine-grained feature representation, resulting in better detection performance. Experiments demonstrate the effectiveness of our proposed method. GEDF-Net is our submission that achieved 1st place in the DCASE 2024 Challenge Task 10. © 2025 IEEE.
Author Keywords Acoustic-based traffic monitoring; feature fusion; graph attention; pre-trained model; transfer learning


Similar Articles


Id Similarity Authors Title Published
50556 View0.867Shabbir A.; Cheema A.N.; Ullah I.; Almanjahie I.M.; Alshahrani F.Smart City Traffic Management: Acoustic-Based Vehicle Detection Using Stacking-Based Ensemble Deep Learning ApproachIEEE Access, 12 (2024)
44289 View0.854Saradopoulos I.; Potamitis I.; Ntalampiras S.; Rigakis I.; Manifavas C.; Konstantaras A.Real-Time Acoustic Detection Of Critical Incidents In Smart Cities Using Artificial Intelligence And Edge NetworksSensors, 25, 8 (2025)
17123 View0.851Min R.; Chen Y.; Wang H.; Chen Y.Das Vehicle Signal Extraction Using Machine Learning In Urban Traffic MonitoringIEEE Transactions on Geoscience and Remote Sensing, 62 (2024)