Smart City Gnosys

Smart city article details

Title Efficient Dnn Framework For Multimodal Educational Fusion: Implementation In Talent Cultivation
ID_Doc 22292
Authors Yu H.
Year 2025
Published Proceedings of SPIE - The International Society for Optical Engineering, 13682
DOI http://dx.doi.org/10.1117/12.3075587
Abstract This study introduces an efficient deep neural network (DNN) framework for multimodal data fusion, targeting the integration of text, video, behavioral logs, and optical sensor data in smart education systems and talent cultivation platforms. The framework incorporates a dynamic feature selection module to prioritize critical multimodal features while reducing redundancy, alongside a hybrid compression pipeline that synergizes pruning and quantization to achieve a 62% reduction in floating-point operations (FLOPs) without compromising accuracy. A cross-modal contrastive alignment mechanism is further employed to bridge semantic gaps between heterogeneous modalities, leading to an F1-score of 91.5%. Incorporating optical sensing data enhances the detection of engagement levels, such as attention span and focus, which are critical for adaptive learning in smart cities. © COPYRIGHT SPIE. Downloading of the abstract is permitted for personal use only.
Author Keywords Cross Modal Alignment; Deep Neural Networks; Dynamic Feature Selection; Educational Systems; Model Compression; Multimodal Data Fusion


Similar Articles


Id Similarity Authors Title Published
33986 View0.869Xu B.Iot-Based Multimodal Learning Framework For Predicting Student Engagement In English EducationProceedings of SPIE - The International Society for Optical Engineering, 13682 (2025)