Smart City Gnosys

Smart city article details

Title Improving Fast Adaptation For Newcomers In Multi-Robot Reinforcement Learning System
ID_Doc 30847
Authors Li Y.; Zhou W.; Wang H.; Ding B.; Xu K.
Year 2019
Published Proceedings - 2019 IEEE SmartWorld, Ubiquitous Intelligence and Computing, Advanced and Trusted Computing, Scalable Computing and Communications, Internet of People and Smart City Innovation, SmartWorld/UIC/ATC/SCALCOM/IOP/SCI 2019
DOI http://dx.doi.org/10.1109/SmartWorld-UIC-ATC-SCALCOM-IOP-SCI.2019.00162
Abstract Multi-robot system has been adopted as a kind of ubiquitous intelligent systems to perform critical tasks in various fields. In multi-robot systems, multi-agent reinforcement learning (MARL) is regarded as a promising technology to support decision-making. However, existing MARL approaches assume either a predefined system configuration or a unified model for agents with identical roles, and thus cannot effectively deal with the dynamic change in the number of robots, which is very common in the real world. This kind of 'adaptation' problem seriously hinders the development of intelligence in multi-robot systems. In this paper, we propose a novel meta-MADDPG approach to enable new robots to integrate into an existing multi-robot system quickly. We build on the MADDPG (Multi-Agent Deep Deterministic Policy Gradient) algorithm and distill the meta-knowledge of a specific robot team by training a meta-actor and a meta-critic simultaneously. The meta-actor can learn an experienced policy net for new robots to perform reasonable actions directly if the situation is urgent, while the meta-critic trains a value net to criticize the current situation for better evolution of new robots. Our experiments on a typical application case (multi-robot collision avoidance) indicate that the meta-knowledge can significantly improve the fast adaptation for the newcomers. Our source code is available at https://github.com/liyiying/meta-MADDPG. © 2019 IEEE.
Author Keywords Deep reinforcement learning; Fast adaptation; Meta-learning; Multi-robot system; Ubiquitous intelligence


Similar Articles


Id Similarity Authors Title Published
26116 View0.93Jia H.; DIng B.; Wang H.; Gong X.; Zhou X.Fast Adaptation Via Meta Learning In Multi-Agent Cooperative TasksProceedings - 2019 IEEE SmartWorld, Ubiquitous Intelligence and Computing, Advanced and Trusted Computing, Scalable Computing and Communications, Internet of People and Smart City Innovation, SmartWorld/UIC/ATC/SCALCOM/IOP/SCI 2019 (2019)
4174 View0.867Chen Z.; Liu Z.; Wan L.; Chen X.; Zhu Y.; Wang C.; Cheng X.; Zhang Y.; Zhang S.; Wang X.; Lan X.A Review Of Multi-Agent Reinforcement Learning Theory And Applications; [多智能体强化学习理论及其应用综述]Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 37, 10 (2024)