Smart City Gnosys

Smart city article details

Title Point And Voxel Cross Perception With Lightweight Cosformer For Large-Scale Point Cloud Semantic Segmentation
ID_Doc 42261
Authors Zhang S.; Wang B.; Chen Y.; Zhang S.; Zhang W.
Year 2024
Published International Journal of Applied Earth Observation and Geoinformation, 131
DOI http://dx.doi.org/10.1016/j.jag.2024.103951
Abstract Semantic segmentation of large-scale point clouds is crucial for advancing smart city infrastructure and supporting autonomous driving technology. However, existing semantic segmentation techniques designed for indoor environments often struggle to adapt to vast outdoor scenes. Moreover, networks for large-scale scenes face challenges such as limited receptive fields and computational complexity, hindering their ability to accurately perceive small target features. To address these challenges, we propose PVCFormer, a novel cross-attention architecture that leverages both point and voxel representations. By feeding concurrently sampled data at varying voxel resolutions into the network, PVCFormer enhances the segmentation of small-scale features while expanding the receptive field. Additionally, the cross-transformer block facilitates better fusion of point and voxel features, and the introduction of CosFormer improves the computational efficiency of the network.Simultaneously, we introduce SYSU9, a new dataset labeled with 9 categories covering an area of over 7 square kilometers, to serve as a benchmark for evaluating point cloud semantic segmentation algorithms. We proposed two model versions, PVCFormer-CA and PVCFormer-SA. PVCFormer-CA achieves an overall accuracy of 92.4 % on SensatUrban, 94.6 % on DALES, and 91.1 % on SYSU9. For semantic segmentation, PVCFormer-CA achieves 61.5 % mIoU on SensatUrban, 73.6 % mIoU on DALES, and 62.4 % mIoU on SYSU9.Our experiments demonstrated promising results in large-scale outdoor point cloud semantic segmentation and introduce novel methodologies leveraging attention mechanisms for handling large-scale point clouds. © 2024
Author Keywords CosFormer; Cross attention; Multi-scale integration; Outdoor scene semantic segmentation


Similar Articles


Id Similarity Authors Title Published
18322 View0.89Luo Z.; Zeng Z.; Tang W.; Wan J.; Xie Z.; Xu Y.Dense Dual-Branch Cross Attention Network For Semantic Segmentation Of Large-Scale Point CloudsIEEE Transactions on Geoscience and Remote Sensing, 62 (2024)
34830 View0.877Li Y.; Ye Z.; Huang X.; HeLi Y.; Shuang F.Lcl_Fda: Local Context Learning And Full-Level Decoder Aggregation Network For Large-Scale Point Cloud Semantic SegmentationNeurocomputing, 621 (2025)
43636 View0.871Yi H.; Liu Y.; Wang M.Psnet: Patch-Based Self-Attention Network For 3D Point Cloud Semantic SegmentationRemote Sensing, 17, 12 (2025)
17074 View0.863Zaboli M.; Rastiveis H.; Hosseiny B.; Shokri D.; Sarasua W.A.; Homayouni S.D-Net: A Density-Based Convolutional Neural Network For Mobile Lidar Point Clouds Classification In Urban AreasRemote Sensing, 15, 9 (2023)
16869 View0.859Gao L.; Liu Y.; Chen X.; Liu Y.; Yan S.; Zhang M.Cus3D: A New Comprehensive Urban-Scale Semantic-Segmentation 3D Benchmark DatasetRemote Sensing, 16, 6 (2024)
18256 View0.852Li Z.; Ning X.; Lv Z.; Shi Z.; Jin H.; Wang Y.; Zhou W.Demf-Net: Dual-Branch Feature Enhancement And Multi-Scale Fusion For Semantic Segmentation Of Large-Scale Point Clouds; [Demf-Net:基于双分支增强和多尺度融合的大规模点云语义分割]Journal of Graphics, 46, 2 (2025)
48566 View0.852Li X.; Zhang Z.; Li Y.; Huang M.; Zhang J.Sfl-Net: Slight Filter Learning Network For Point Cloud Semantic SegmentationIEEE Transactions on Geoscience and Remote Sensing, 61 (2023)