Learning-Based N-Step Heuristic Dynamic Programming for Affine Nonlinear Optimal Regulation - Details

Author：

Xin, Peng (Xin, Peng.) | Wang, Ding (Wang, Ding.) | Zhao, Mingming (Zhao, Mingming.) | Ha, Mingming (Ha, Mingming.) | Ren, Jin (Ren, Jin.)

Indexed by：

EI Scopus

Abstract：

This　paper　introduces　n-step　heuristic　dynamic　programming　(NSHDP),　which　combines　regular　temporal　difference　(TD)　learning　with　TD(λ)　learning,　in　order　to　solve　optimal　control　problems.　First,　the　implementation　process　of　the　basic　value　iteration　algorithm　is　proposed.　Then,　based　on　the　traditional　HDP　algorithm,　the　architecture　of　the　NSHDP(λ)　algorithm　is　described.　At　the　same　time,　the　most　important　thing　is　that　the　stability　condition　of　the　NSHDP(λ)　algorithm　is　developed.　Furthermore,　the　one-step　critic　network,　the　n-step　critic　network,　and　the　action　network　are　designed,　respectively.　Finally,　the　effectiveness　of　the　proposed　algorithm　is　verified　by　simulation　experiment.　©　2022　Technical　Committee　on　Control　Theory,　Chinese　Association　of　Automation.

Keyword：

Dynamic programming Optimal control systems Iterative methods Neural networks Heuristic programming

Author Community：

[ 1 ] [Xin, Peng]Faculty of Information Technology, Beijing University of Technology, Beijing; 100124, China
[ 2 ] [Xin, Peng]Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing University of Technology, Beijing; 100124, China
[ 3 ] [Xin, Peng]Beijing Laboratory of Smart Environmental Protection, Beijing University of Technology, Beijing; 100124, China
[ 4 ] [Xin, Peng]Beijing Institute of Artificial Intelligence, Beijing University of Technology, Beijing; 100124, China
[ 5 ] [Wang, Ding]Faculty of Information Technology, Beijing University of Technology, Beijing; 100124, China
[ 6 ] [Wang, Ding]Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing University of Technology, Beijing; 100124, China
[ 7 ] [Wang, Ding]Beijing Laboratory of Smart Environmental Protection, Beijing University of Technology, Beijing; 100124, China
[ 8 ] [Wang, Ding]Beijing Institute of Artificial Intelligence, Beijing University of Technology, Beijing; 100124, China
[ 9 ] [Zhao, Mingming]Faculty of Information Technology, Beijing University of Technology, Beijing; 100124, China
[ 10 ] [Zhao, Mingming]Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing University of Technology, Beijing; 100124, China
[ 11 ] [Zhao, Mingming]Beijing Laboratory of Smart Environmental Protection, Beijing University of Technology, Beijing; 100124, China
[ 12 ] [Zhao, Mingming]Beijing Institute of Artificial Intelligence, Beijing University of Technology, Beijing; 100124, China
[ 13 ] [Ha, Mingming]School of Automation and Electrical Engineering, University of Science and Technology Beijing, Beijing; 100083, China
[ 14 ] [Ren, Jin]Faculty of Information Technology, Beijing University of Technology, Beijing; 100124, China
[ 15 ] [Ren, Jin]Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing University of Technology, Beijing; 100124, China
[ 16 ] [Ren, Jin]Beijing Laboratory of Smart Environmental Protection, Beijing University of Technology, Beijing; 100124, China
[ 17 ] [Ren, Jin]Beijing Institute of Artificial Intelligence, Beijing University of Technology, Beijing; 100124, China

Reprint Author's Address：

Email：

Show more details

Related Keywords：

Improved Adaptive Critic for Neural Optimal Control of Constrained Nonlinear Discrete-Time Systems
2020，39th Chinese Control Conference, CCC 2020
Decentralized Tracking Control of Interconnected Nonlinear Systems with Unmatched External Disturbances
2024，43rd Chinese Control Conference, CCC 2024
Optimal Tracking Control for Constrained Two-Link Robot Based on Adaptive Dynamic Programming
2024，13th IEEE Data Driven Control and Learning Systems Conference, DDCLS 2024
Robust Policy Learning Control Design for Multiplayer Nonzero-Sum Games with Uncertainties
2023，42nd Chinese Control Conference, CCC 2023

Source ：

ISSN： 1934-1768

Year： 2022

Volume： 2022-July

Page： 2242-2247

Language： English

Cited Count：

WoS CC Cited Count：

SCOPUS Cited Count：

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 0

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search Engineering Village

Type
Departments

All Years Choose Year From to