Reinforcement Learning-based Fault-tolerant Attitude Control of Spacecraft under Actuator Failures

Fuxiang Liu*, Yupeng Liang

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

A fault-tolerant control method based on the Deep Deterministic Policy Gradient (DDPG) algorithm is proposed for spacecraft attitude control systems in the presence of actuator faults and external disturbances. Initially, a spacecraft dynamics model under actuator faults is established, and a finite-time sliding mode controller is designed to ensure that the sliding surface converges to zero within a finite time. Subsequently, to address the sensitivity of the finite-time sliding mode controller's performance to its parameters, reinforcement learning is employed to optimize the controller parameters. Finally, simulation experiments are conducted. The results demonstrate that the proposed reinforcement learning-based controller can achieve convergence of both the attitude quaternion and the angular velocity within a finite time.

Original languageEnglish
Title of host publicationProceedings of the 2025 International Conference on Intelligent Systems, Automation and Control, ISAC 2025
PublisherAssociation for Computing Machinery, Inc
Pages123-127
Number of pages5
ISBN (Electronic)9798400715198
DOIs
Publication statusPublished - 8 Jun 2025
Externally publishedYes
Event2025 International Conference on Intelligent Systems, Automation and Control, ISAC 2025 - Xi'an, China
Duration: 28 Mar 202530 Mar 2025

Publication series

NameProceedings of the 2025 International Conference on Intelligent Systems, Automation and Control, ISAC 2025

Conference

Conference2025 International Conference on Intelligent Systems, Automation and Control, ISAC 2025
Country/TerritoryChina
CityXi'an
Period28/03/2530/03/25

Keywords

  • Actuator failures
  • Fault-tolerant control
  • Reinforcement learning
  • Spacecraft

Fingerprint

Dive into the research topics of 'Reinforcement Learning-based Fault-tolerant Attitude Control of Spacecraft under Actuator Failures'. Together they form a unique fingerprint.

Cite this