|Table of Contents|

Research on Autonomous Landing Control of Carrier-borne UCAV Based on Deep Reinforcement Learning Technology(PDF)

南京师范大学学报(工程技术版)[ISSN:1006-6977/CN:61-1281/TN]

Issue:
2022年03期
Page:
63-71
Research Field:
计算机科学与技术
Publishing date:

Info

Title:
Research on Autonomous Landing Control of Carrier-borne UCAV Based on Deep Reinforcement Learning Technology
Author(s):
Huang JiangtaoLiu GangZhou PanZhang ShengDu Xin
(Aerospace Technology Institute,China Aerodynamics Research and Development Center,Mianyang 621000,China)
Keywords:
reinforcement learningcarrier-borne UAVintelligent carrier landingcontrol surface commanddeep neural network
PACS:
V211.3
DOI:
10.3969/j.issn.1672-1292.2022.03.009
Abstract:
Autonomous landing is an important problem and a key technology for future Carrier-borne UAV. Based on the TD3 algorithm,combined with the 6-DOF motion model of carrier aircraft and the motion model of aircraft carrier,an interactive deep reinforcement learning simulation environment is constructed. In the process of simulation training,the corresponding simplified motion model is established by considering the sea conditions,three line disturbances of aircraft carrier including surge,sway and heave,and three angular disturbances of roll,pitch and yaw. Based on the aerodynamic data of a certain type of aircraft,the aerodynamic model is established,and the six degree of freedom dynamics model is also established. Based on TD3 reinforcement learning algorithm,this paper further introduces an auxiliary network,an adaptive variance and learning step adjustment algorithm to accelerate convergence and improve training stability. Furthermore,combined with feed forward deep neural network technology,an interactive training environment for carrier based aircraft landing is established on high performance GPU workstation. Through the "trial and error" training of a certain type of Carrier-borne UAV in interactive environment,the feasibility of AI technology in Carrier-borne UAV autonomous landing control is verified.

References:

[1]安军. 航母尾流模拟及舰载机着舰控制的初步研究[D]. 武汉:华中科技大学,2012.
[2]巩鹏潇,詹浩,柳子栋. 舰尾流影响下的舰载机着舰控制与仿真研究[J]. 航空工程进展,2013,4(3):339-345,357.
[3]张孝伟. 飞翼舰载无人机着舰控制技术研究[D]. 南京:南京航空航天大学,2017.
[4]郑峰婴,龚华军,甄子洋. 基于坐标系动态变化的无人机着舰引导算法[J]. 中南大学学报(自然科学版),2016,47(8):2685-2693.
[5]ANDERSON M R. Inner and outer loop manual control of automatic carrier landing[C]//Proceedings of 1996 Guidance,Navigation,and Control Conference(AIAA). San Diego,USA:AIAA,1996.
[6]黄旭,柳嘉润,贾晨辉,等. 深度确定性策略梯度算法用于无人飞行器控制[J]. 航空学报,2021,42(11):397-407.
[7]余洋. 面向飞行器自主着舰问题的行动者-评论家算法模型研究与实现[D]. 北京:北京交通大学,2019.
[8]吴昭欣,李辉,王壮,等. 基于深度强化学习的智能仿真平台设计[J]. 战术导弹技术,2020(4):193-200.
[9]方振平,陈万春,张曙光. 航空飞行器飞行动力学[M]. 北京:北京航空航天大学出版社,2005.
[10]GARNETT T S. Investigation to study the aerodynamic ship wake turbulence generated by a DD963 Destroyer:ADA083663[R]. Philadelphia,USA:Boeing Vetrol Co.,1979.
[11]COLAS C,SIGAUD O,OUDEYER P Y. GEP-PG:decoupling exploration and exploitation in deep reinforcement learning algorithms[C]//Proceedings of the 35th International Conference on Machine Learning. Stockholm,Sweden:PMLR,2018.
[12]LILLICRAP T P,HUNT J J,PRITZEL A,et al. Continuous control with deep reinforcement learning[J]. Computer Science,2015,8(6):A187.
[13]焦李成,赵进,杨淑媛,等. 深度学习、优化与识别[M]. 北京:清华大学出版社,2017.
[14]谭浪. 强化学习在多智能体对抗中的应用研究[D]. 北京:中国运载火箭技术研究院,2019.
[15]SUTTONR S,BARTO A G. Reinforcement learning:an introduction[M]. Cambridge,England:MIT Press,2018.

Memo

Memo:
-
Last Update: 2022-09-15