Novel optimal trajectory tracking for nonlinear affine systems with an advanced critic learning structure - Details

Author：

Wang, Ding (Wang, Ding.) (Scholars：王鼎) | Zhao, Huiling (Zhao, Huiling.) | Zhao, Mingming (Zhao, Mingming.) | Ren, Jin (Ren, Jin.)

Indexed by：

EI Scopus SCIE

Abstract：

In　this　paper,　a　critic　learning　structure　based　on　the　novel　utility　function　is　developed　to　solve　the　optimal　tracking　control　problem　with　the　discount　factor　of　affine　nonlinear　systems.　The　utility　function　is　defined　as　the　quadratic　form　of　the　error　at　the　next　moment,　which　can　not　only　avoid　solving　the　stable　control　input,　but　also　effectively　eliminate　the　tracking　error.　Next,　the　theoretical　derivation　of　the　method　under　value　iteration　is　given　in　detail　with　convergence　and　stability　analysis.　Then,　the　dual　heuristic　dynamic　programming　(DHP)　algorithm　via　a　single　neural　network　is　introduced　to　reduce　the　amount　of　computation.　The　polynomial　is　used　to　approximate　the　costate　function　during　the　DHP　implementation.　The　weighted　residual　method　is　used　to　update　the　weight　matrix.　During　simulation,　the　convergence　speed　of　the　given　strategy　is　compared　with　the　heuristic　dynamic　programming　(HDP)　algorithm.　The　experiment　results　display　that　the　convergence　speed　of　the　proposed　method　is　faster　than　the　HDP　algorithm.　Besides,　the　proposed　method　is　compared　with　the　traditional　tracking　control　approach　to　verify　its　tracking　performance.　The　experiment　results　show　that　the　proposed　method　can　avoid　solving　the　stable　control　input,　and　the　tracking　error　is　closer　to　zero　than　the　traditional　strategy.　(C)　2022　Elsevier　Ltd.　All　rights　reserved.

Keyword：

Value iteration Polynomial Dual heuristic dynamic programming Optimal tracking control Discount factor Neural networks

Author Community：

[ 1 ] [Wang, Ding]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
[ 2 ] [Wang, Ding]Beijing Univ Technol, Beijing Key Lab Computat Intelligence & Intellige, Beijing 100124, Peoples R China
[ 3 ] [Wang, Ding]Beijing Univ Technol, Beijing Inst Artificial Intelligence, Beijing 100124, Peoples R China
[ 4 ] [Wang, Ding]Beijing Univ Technol, Beijing Lab Smart Environm Protect, Beijing 100124, Peoples R China

Reprint Author's Address：

Email：

dingwang@bjut.edu.cn |
zhaohuiling@emails.bjut.edu.cn |
zhaomm@emails.bjut.edu.cn |
renjin@emails.bjut.edu.cn

Show more details

Related Keywords：

Improved value iteration for neural-network-based stochastic optimal control design
2020，NEURAL NETWORKS
Dichotomy value iteration with parallel learning design towards discrete-time zero-sum games
2023，NEURAL NETWORKS
Neural critic learning with accelerated value iteration for nonlinear model predictive control
2024，NEURAL NETWORKS
Data-Based Nonaffine Optimal Tracking Control Using Iterative DHP Approach
2020，21st IFAC World Congress on Automatic Control - Meeting Societal Challenges

Source ：

NEURAL NETWORKS

ISSN： 0893-6080

Year： 2022

Volume： 154

Page： 131-140

7 . 8

JCR@2022

7 . 8 0 0

JCR@2022

ESI Discipline： COMPUTER SCIENCE;

ESI HC Threshold：46

JCR Journal Grade：1

CAS Journal Grade：2

Cited Count：

WoS CC Cited Count： 2

SCOPUS Cited Count： 3

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 2

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to