Smart City Gnosys

Smart city article details

Title Iot-Based Multimodal Learning Framework For Predicting Student Engagement In English Education
ID_Doc 33986
Authors Xu B.
Year 2025
Published Proceedings of SPIE - The International Society for Optical Engineering, 13682
DOI http://dx.doi.org/10.1117/12.3075528
Abstract With the rapid development of Internet of Things (IoT) and Artificial Intelligence (AI), multimodal learning has become an important part of intelligent systems. It helps improve decision-making by integrating and analyzing various data sources. These technologies have the potential to revolutionize the field of education, especially in smart cities, by monitoring and optimizing the way students are engaged. This study introduces a multimodal learning framework based on Transformers. The framework integrates IoT devices, artificial intelligence and smart city infrastructure and uses optical sensing technology to predict student engagement in English language learning. The framework uses optical IoT sensors to collect real-time, high-fidelity multimodal data. This data includes voice signals, physiological indicators and facial expressions. To ensure accuracy and reliability in dynamic environments, these data are preprocessed and synchronized. In the converter architecture, the dynamic attention mechanism can be adjusted according to the priority of modal features. This helps to address the issues of heterogeneous data fusion, noise immunity and computational efficiency. Experimental results from real classroom environments in smart cities show that the framework achieves a prediction accuracy of 91.2% with an F1 score of 0.90, which is more accurate than the traditional model based on the LSTM method and CNN. In addition, model compression techniques such as quantization and weight pruning improve the computational performance and real-time processing with an average inference time of 10ms. These findings suggest that the framework can enhance intelligent educational systems through accurate and efficient engagement prediction. The proposed strategies show great potential for improving personalized learning and adaptive teaching strategies. These strategies provide scalable and practical solutions for real-world applications in smart cities. © COPYRIGHT SPIE. Downloading of the abstract is permitted for personal use only.
Author Keywords IOT sensors; multimodal learning; Optical Sensing; smart cities; transformer architecture


Similar Articles


Id Similarity Authors Title Published
22292 View0.869Yu H.Efficient Dnn Framework For Multimodal Educational Fusion: Implementation In Talent CultivationProceedings of SPIE - The International Society for Optical Engineering, 13682 (2025)
7385 View0.867Embarak O.An Adaptive Paradigm For Smart Education Systems In Smart Cities Using The Internet Of Behaviour (Iob) And Explainable Artificial Intelligence (Xai)8th International Conference on Information Technology Trends: Industry 4.0: Technology Trends and Solutions, ITT 2022 (2022)
33739 View0.865Setiawan R.; Devadass M.M.V.; Rajan R.; Sharma D.K.; Singh N.P.; Amarendra K.; Ganga R.K.R.; Manoharan R.R.; Subramaniyaswamy V.; Sengan S.Iot Based Virtual E-Learning System For Sustainable Development Of Smart CitiesJournal of Grid Computing, 20, 3 (2022)
32848 View0.853Embarak O.H.Internet Of Behaviour (Iob)-Based Ai Models For Personalized Smart Education SystemsProcedia Computer Science, 203 (2022)
2102 View0.852Cao H.; Wachowicz M.A Holistic Overview Of Anticipatory Learning For The Internet Of Moving Things: Research Challenges And OpportunitiesISPRS International Journal of Geo-Information, 9, 4 (2020)