Smart City Gnosys
Smart city article details
| Title | Scaling Up Energy-Aware Multiagent Reinforcement Learning For Mission-Oriented Drone Networks With Individual Reward |
|---|---|
| ID_Doc | 47361 |
| Authors | Li C.; Li Y. |
| Year | 2025 |
| Published | IEEE Internet of Things Journal, 12, 8 |
| DOI | http://dx.doi.org/10.1109/JIOT.2024.3511253 |
| Abstract | Multiagent reinforcement learning (MARL) has shown wide applicability in collaborative systems, such as autonomous driving and smart cities for its ability of learning through interaction. With the recent development of drone networks, researchers have also applied MARL to address the trajectory planning problems. However, the dynamic environment and the limited battery capacity are still challenging for using MARL to achieve efficient collaborative task execution. In this article, we propose an energy-aware MARL model as an attempt to tackle these challenges, leveraging deep Q-networks (DQNs) with individual reward functions driven by the task execution progress and the remaining battery of drones. We conduct a set of simulation studies for the proposed mode and compare it with the shared reward MARL (Li et al., 2022) to explore the impact of credit assignment in MARL. The results indicate that our proposed model can achieve at least 80% success rate regardless of the task locations and lengths. Similar to the shared reward mode, the individual reward mode can achieve a better success rate when the task density is high, and it can hit nearly a 100% success rate when task density gets close to 40%. The true advantage of our proposed model with individual reward is revealed when scaling up the environment. The comparison to the shared reward MARL shows that the our proposed model is more robust toward the change of the environment size and agent numbers. It can achieve higher success rate with fewer steps due to the clarity of the goal which improves energy efficiency even better. © 2024 IEEE. |
| Author Keywords | Collaborative execution; deep Q-network (DQN); drone networks; mission-oriented; multiagent reinforcement learning (MARL) |
