TD3-Based Model Predictive Control for Satellite Formation-KeepingSource: Journal of Aerospace Engineering:;2024:;Volume ( 037 ):;issue: 006::page 04024077-1DOI: 10.1061/JAEEEZ.ASENG-5646Publisher: American Society of Civil Engineers
Abstract: The escalating prevalence of formation flights in space missions has led researchers to intensify their focus on designing optimal control systems for satellite formation motion along reference orbits, with the aim of reducing tracking error and energy consumption. However, conventional controllers typically excel at optimizing only one of these objectives, and the manual parameter tuning of such controllers proves to be a challenging task. In this paper, we introduce a novel approach, the twin delayed deep deterministic policy gradient-based model predictive control (TD3-MPC) method. To tackle the multiobjective formation-keeping challenge, a linear model predictive controller based on the satellite’s dynamics had been developed. Subsequently, a cost function is formulated to facilitate the optimization of multiple objectives, specifically tracking error and fuel consumption. In addressing the intricate issue of controller parameter tuning, we employ reinforcement learning and design a reward function reflective of the TD3 algorithm’s controller performance. Simulation results underscore the superior performance of the proposed TD3-MPC algorithm compared to the linear model predictive controller, achieving a notable 27.83% reduction in tracking error and a substantial 48.30% decrease in fuel consumption under large error condition and 3.67% reduction in tracking error and a substantial 22.27% decrease in fuel consumption under small error condition. By effectively combining the strengths of reinforcement learning and model predictive control, TD3-MPC enhances the satellite’s ability to adhere more precisely to its intended trajectory, thereby ensuring the stability and desired operational performance of the satellite formation.
|
Collections
Show full item record
contributor author | Xing Hu | |
contributor author | Zhi Zhai | |
contributor author | Jinxin Liu | |
contributor author | Chenxi Wang | |
contributor author | Naijin Liu | |
contributor author | Xuefeng Chen | |
date accessioned | 2024-12-24T10:15:23Z | |
date available | 2024-12-24T10:15:23Z | |
date copyright | 11/1/2024 12:00:00 AM | |
date issued | 2024 | |
identifier other | JAEEEZ.ASENG-5646.pdf | |
identifier uri | http://yetl.yabesh.ir/yetl1/handle/yetl/4298581 | |
description abstract | The escalating prevalence of formation flights in space missions has led researchers to intensify their focus on designing optimal control systems for satellite formation motion along reference orbits, with the aim of reducing tracking error and energy consumption. However, conventional controllers typically excel at optimizing only one of these objectives, and the manual parameter tuning of such controllers proves to be a challenging task. In this paper, we introduce a novel approach, the twin delayed deep deterministic policy gradient-based model predictive control (TD3-MPC) method. To tackle the multiobjective formation-keeping challenge, a linear model predictive controller based on the satellite’s dynamics had been developed. Subsequently, a cost function is formulated to facilitate the optimization of multiple objectives, specifically tracking error and fuel consumption. In addressing the intricate issue of controller parameter tuning, we employ reinforcement learning and design a reward function reflective of the TD3 algorithm’s controller performance. Simulation results underscore the superior performance of the proposed TD3-MPC algorithm compared to the linear model predictive controller, achieving a notable 27.83% reduction in tracking error and a substantial 48.30% decrease in fuel consumption under large error condition and 3.67% reduction in tracking error and a substantial 22.27% decrease in fuel consumption under small error condition. By effectively combining the strengths of reinforcement learning and model predictive control, TD3-MPC enhances the satellite’s ability to adhere more precisely to its intended trajectory, thereby ensuring the stability and desired operational performance of the satellite formation. | |
publisher | American Society of Civil Engineers | |
title | TD3-Based Model Predictive Control for Satellite Formation-Keeping | |
type | Journal Article | |
journal volume | 37 | |
journal issue | 6 | |
journal title | Journal of Aerospace Engineering | |
identifier doi | 10.1061/JAEEEZ.ASENG-5646 | |
journal fristpage | 04024077-1 | |
journal lastpage | 04024077-18 | |
page | 18 | |
tree | Journal of Aerospace Engineering:;2024:;Volume ( 037 ):;issue: 006 | |
contenttype | Fulltext |