Smart City Gnosys

Smart city article details

Title Multi-Agent Meta Reinforcement Learning For Reliable And Low-Latency Distributed Inference In Resource-Constrained Uav Swarms
ID_Doc 38095
Authors Dhuheir M.; Erbad A.; Hamdaoui B.; Belhaouari S.B.; Guizani M.; Vu T.X.
Year 2025
Published IEEE Access, 13
DOI http://dx.doi.org/10.1109/ACCESS.2025.3572036
Abstract The integration of unmanned aerial vehicles (UAVs) in the Industrial Internet of Things (IIoT) for smart city applications has been gaining significant attention. UAV swarms are increasingly employed to monitor ground-based IIoT devices in smart cities, offering valuable support to situation-awareness IoT applications, such as surveillance, traffic management, and emergency response. A key requirement in these applications is minimizing the latency of data processing, particularly for time-sensitive tasks like image classification of IIoT device data. Due to resource limitations, UAVs often rely on online task offloading to remote machines, but this can be inefficient due to unstable connections, constrained resources, and high latency. Distributed inference enabled via swarms of collaborative UAVs presents a promising solution by partitioning tasks among UAVs based on their available resources, allowing for more efficient, collaborative processing. However, the IIoT inference distribution raises challenges in ensuring reliable data transmission with minimal latency while respecting the practical UAVs’ constraints. To address these issues, we formulate the problem of CNN layer distribution and UAV trajectory planning (LDTP) as an optimization problem to improve latency, reliability, and resource usage. Given the complexity of the LDTP solution for managing online requests, we propose a real-time, lightweight solution using multi-agent meta-reinforcement learning. Our approach is tested on CNN networks and benchmarked against state-of-the-art conventional reinforcement learning algorithms. Extensive simulations show that our model outperforms competitive methods by around 29% in terms of latency and around 23% in terms of transmission power improvements while delivering results comparable to the traditional LDTP optimization solution by around 9% in terms of latency. © 2013 IEEE.
Author Keywords distributed resource optimization; energy harvesting; Industrial Internet of Things; Meta-reinforcement learning; UAV swarms


Similar Articles


Id Similarity Authors Title Published
6281 View0.883LI Y.; QU Y.; DONG C.; QIN Z.; ZHANG L.; WU Q.Adaptive Model Switching Of Collaborative Inference For Multi-Cnn Streams In Uav SwarmChinese Journal of Aeronautics, 38, 8 (2025)
16204 View0.882Qin C.; Pournaras E.Coordination Of Drones At Scale: Decentralized Energy-Aware Swarm Intelligence For Spatio-Temporal SensingTransportation Research Part C: Emerging Technologies, 157 (2023)
16153 View0.88Yun W.J.; Park S.; Kim J.; Shin M.; Jung S.; Mohaisen D.A.; Kim J.-H.Cooperative Multiagent Deep Reinforcement Learning For Reliable Surveillance Via Autonomous Multi-Uav ControlIEEE Transactions on Industrial Informatics, 18, 10 (2022)
23748 View0.874Gomes D.; Hasan M.; Philip S.R.Enhancing Capabilities And Security Features Of Drone Networks Through Machine Learning: A Comprehensive OverviewAdvances in Science, Technology and Innovation, Part F372 (2025)
22476 View0.874Qu Y.; Sun H.; Dong C.; Kang J.; Dai H.; Wu Q.; Guo S.Elastic Collaborative Edge Intelligence For Uav Swarm: Architecture, Challenges, And OpportunitiesIEEE Communications Magazine, 62, 1 (2024)
2337 View0.874Xi M.; Dai H.; He J.; Li W.; Wen J.; Xiao S.; Yang J.A Lightweight Reinforcement-Learning-Based Real-Time Path-Planning Method For Unmanned Aerial VehiclesIEEE Internet of Things Journal, 11, 12 (2024)
38114 View0.872Neelamegam G.; Venkatesan R.; Ramya S.R.; Ramya R.S.; Akshya J.; Sundarrajan M.; Choudhry M.D.Multi-Agent Systems For Autonomous Iot Network Management Using Distributed Reinforcement LearningProceedings of 2025 3rd International Conference on Intelligent Systems, Advanced Computing, and Communication, ISACC 2025 (2025)
8277 View0.872Li C.; Feng Q.; Ding C.; Ye Z.An Improved Multi-Actor Hybrid Attention Critic Algorithm For Cooperative Navigation In Urban Low-Altitude Logistics EnvironmentsComputers, Materials and Continua, 84, 2 (2025)
35845 View0.871Qin C.; Robins A.; Lillywhite-Roake C.; Pearce A.; Mehta H.; James S.; Wong T.H.; Pournaras E.M-Set: Multi-Drone Swarm Intelligence Experimentation With Collision Avoidance RealismProceedings - Conference on Local Computer Networks, LCN (2024)
1129 View0.867Liu X.; Wang Y.; Gao H.; Ngai E.C.H.; Zhang B.; Wang C.; Wang W.A Coverage-Aware Task Allocation Method For Uav-Assisted Mobile Crowd SensingIEEE Transactions on Vehicular Technology, 73, 7 (2024)