TY - GEN
T1 - Cooperative Persistent Surveillance with a Multi-Ugv System based on Reinforcement Learning
AU - Li, Guangzheng
AU - Li, Zhuo
AU - Wang, Gang
AU - Wu, Chuge
AU - Wang, Jingjing
AU - Sun, Jian
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - This paper investigates a persistent surveillance problem using a group of unmanned ground vehicles (UGVs) in a cooperative manner. The primary objective is to achieve continuous and frequent coverage of the entire target area through cooperation of the multi-UGV system. To this end, we model the persistent surveillance problem as a decentralized partially observable Markov decision process, where a knowability map is introduced for the target area and employed in the design of reward functions. Due to the limited sensing range of each UGV, the knowability map cannot be directly available. Thus, a consensus-based estimation method is designed for each UGV for estimation, and the issue of partial observability is resolved by fully exploiting observations from neighboring UGVs. Furthermore, we propose a deep reinforcement learningbased algorithm with the architecture of centralized training and distributed execution, which derives efficient cooperative surveillance policies for the UGVs. Extensive simulations demonstrate the effectiveness and robustness of the proposed algorithm for the persistent surveillance.
AB - This paper investigates a persistent surveillance problem using a group of unmanned ground vehicles (UGVs) in a cooperative manner. The primary objective is to achieve continuous and frequent coverage of the entire target area through cooperation of the multi-UGV system. To this end, we model the persistent surveillance problem as a decentralized partially observable Markov decision process, where a knowability map is introduced for the target area and employed in the design of reward functions. Due to the limited sensing range of each UGV, the knowability map cannot be directly available. Thus, a consensus-based estimation method is designed for each UGV for estimation, and the issue of partial observability is resolved by fully exploiting observations from neighboring UGVs. Furthermore, we propose a deep reinforcement learningbased algorithm with the architecture of centralized training and distributed execution, which derives efficient cooperative surveillance policies for the UGVs. Extensive simulations demonstrate the effectiveness and robustness of the proposed algorithm for the persistent surveillance.
UR - http://www.scopus.com/pages/publications/105016182319
U2 - 10.1109/ICCA65672.2025.11129787
DO - 10.1109/ICCA65672.2025.11129787
M3 - Conference contribution
AN - SCOPUS:105016182319
T3 - IEEE International Conference on Control and Automation, ICCA
SP - 781
EP - 786
BT - 2025 IEEE 19th International Conference on Control and Automation, ICCA 2025
PB - IEEE Computer Society
T2 - 19th IEEE International Conference on Control and Automation, ICCA 2025
Y2 - 30 June 2025 through 3 July 2025
ER -