Journal of System Simulation

Research on Autonomous Decision-making in Air-combat Based on Improved Proximal Policy Optimization

Dianwei Qian, School of Control and Computer Engineering, North China Electric Power University, Beijing 102206, China
Hongmin Qi, School of Control and Computer Engineering, North China Electric Power University, Beijing 102206, China
Zhen Liu, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China
Zhiming Zho, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China
Jianqiang Yi, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China

Abstract

Abstract: To address the problems of high information redundancy and slow convergence speed of traditional reinforcement learning in air-combat autonomous decision-making applications, a proximal policy optimization air-combat autonomous decision-making method, based on dual observation and composite reward is proposed. A dual observation space, which contains interaction information as the main information and individual feature information as a supplement, was designed to reduce the influence of redundant battlefield information on the training efficiency of the decision model. A composite reward function combining result reward and process reward was designed to improve convergence speed. The generalized advantage estimator was applied in the proximal policy optimization strategy algorithm to improve the accuracy of advantage function estimation. Simulation results show that the method decision-making model can make precise autonomous decisions and complete air-combat tasks according to the battlefield situation in two types of experimental scenarios: against fixedprogrammed and matrix gaming opponents.

Recommended Citation

Qian, Dianwei; Qi, Hongmin; Liu, Zhen; Zho, Zhiming; and Yi, Jianqiang (2024) "Research on Autonomous Decision-making in Air-combat Based on Improved Proximal Policy Optimization," Journal of System Simulation: Vol. 36: Iss. 9, Article 20.
DOI: 10.16182/j.issn1004731x.joss.23-0584
Available at: https://dc-china-simulation.researchcommons.org/journal/vol36/iss9/20

First Page

2208

Last Page

2218

CLC

TP391.9

Recommended Citation

Qian Dianwei, Qi Hongmin, Liu Zhen, et al. Research on Autonomous Decision-making in Air-combat Based on Improved Proximal Policy Optimization[J]. Journal of System Simulation, 2024, 36(9): 2208-2218.

Corresponding Author

Zhou Zhiming

DOI

10.16182/j.issn1004731x.joss.23-0584

Download

Included in

Artificial Intelligence and Robotics Commons, Computer Engineering Commons, Numerical Analysis and Scientific Computing Commons, Operations Research, Systems Engineering and Industrial Engineering Commons, Systems Science Commons

COinS

Journal of System Simulation

Research on Autonomous Decision-making in Air-combat Based on Improved Proximal Policy Optimization

Authors

Abstract

Recommended Citation

First Page

Last Page

CLC

Recommended Citation

Corresponding Author

DOI

Included in

Share

Search