Smart City Gnosys

Smart city article details

Title	Improving Fast Adaptation For Newcomers In Multi-Robot Reinforcement Learning System
ID_Doc	30847
Authors	Li Y.; Zhou W.; Wang H.; Ding B.; Xu K.
Year	2019
Published	Proceedings - 2019 IEEE SmartWorld, Ubiquitous Intelligence and Computing, Advanced and Trusted Computing, Scalable Computing and Communications, Internet of People and Smart City Innovation, SmartWorld/UIC/ATC/SCALCOM/IOP/SCI 2019
DOI	http://dx.doi.org/10.1109/SmartWorld-UIC-ATC-SCALCOM-IOP-SCI.2019.00162
Abstract	Multi-robot system has been adopted as a kind of ubiquitous intelligent systems to perform critical tasks in various fields. In multi-robot systems, multi-agent reinforcement learning (MARL) is regarded as a promising technology to support decision-making. However, existing MARL approaches assume either a predefined system configuration or a unified model for agents with identical roles, and thus cannot effectively deal with the dynamic change in the number of robots, which is very common in the real world. This kind of 'adaptation' problem seriously hinders the development of intelligence in multi-robot systems. In this paper, we propose a novel meta-MADDPG approach to enable new robots to integrate into an existing multi-robot system quickly. We build on the MADDPG (Multi-Agent Deep Deterministic Policy Gradient) algorithm and distill the meta-knowledge of a specific robot team by training a meta-actor and a meta-critic simultaneously. The meta-actor can learn an experienced policy net for new robots to perform reasonable actions directly if the situation is urgent, while the meta-critic trains a value net to criticize the current situation for better evolution of new robots. Our experiments on a typical application case (multi-robot collision avoidance) indicate that the meta-knowledge can significantly improve the fast adaptation for the newcomers. Our source code is available at https://github.com/liyiying/meta-MADDPG. © 2019 IEEE.
Author Keywords	Deep reinforcement learning; Fast adaptation; Meta-learning; Multi-robot system; Ubiquitous intelligence

Similar Articles

Id	Similarity	Authors	Title	Published
26116	0.93	Jia H.; DIng B.; Wang H.; Gong X.; Zhou X.	Fast Adaptation Via Meta Learning In Multi-Agent Cooperative Tasks	Proceedings - 2019 IEEE SmartWorld, Ubiquitous Intelligence and Computing, Advanced and Trusted Computing, Scalable Computing and Communications, Internet of People and Smart City Innovation, SmartWorld/UIC/ATC/SCALCOM/IOP/SCI 2019 (2019)
4174	0.867	Chen Z.; Liu Z.; Wan L.; Chen X.; Zhu Y.; Wang C.; Cheng X.; Zhang Y.; Zhang S.; Wang X.; Lan X.	A Review Of Multi-Agent Reinforcement Learning Theory And Applications; [多智能体强化学习理论及其应用综述]	Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 37, 10 (2024)