Improved value iteration for neural-network-based stochastic optimal control design - Details

Author：

Liang, Mingming (Liang, Mingming.) | Wang, Ding (Wang, Ding.) (Scholars：王鼎) | Liu, Derong (Liu, Derong.)

Indexed by：

EI Scopus SCIE PubMed

Abstract：

In　this　paper,　a　novel　value　iteration　adaptive　dynamic　programming　(ADP)　algorithm　is　presented,　which　is　called　an　improved　value　iteration　ADP　algorithm,　to　obtain　the　optimal　policy　for　discrete　stochastic　processes.　In　the　improved　value　iteration　ADP　algorithm,　for　the　first　time　we　propose　a　new　criteria　to　verify　whether　the　obtained　policy　is　stable　or　not　for　stochastic　processes.　By　analyzing　the　convergence　properties　of　the　proposed　algorithm,　it　is　shown　that　the　iterative　value　functions　can　converge　to　the　optimum.　In　addition,　our　algorithm　allows　the　initial　value　function　to　be　an　arbitrary　positive　semi-definite　function.　Finally,　two　simulation　examples　are　presented　to　validate　the　effectiveness　of　the　developed　method.　(C)　2020　Elsevier　Ltd.　All　rights　reserved.

Keyword：

Adaptive dynamic programming Adaptive critic designs Optimal control Neural networks Value iteration Stochastic processes

Author Community：

[ 1 ] [Liang, Mingming]Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China
[ 2 ] [Wang, Ding]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
[ 3 ] [Wang, Ding]Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
[ 4 ] [Liu, Derong]Guangdong Univ Technol, Sch Automat, Guangzhou 510006, Peoples R China

Reprint Author's Address：

王鼎
[Wang, Ding]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China

Email：

liangmingming2015@ia.ac.cn |
dingwang@bjut.edu.cn |
derong@gdut.edu.cn

Show more details

Related Keywords：

Evolution-guided value iteration for optimal tracking control
2024，NEUROCOMPUTING
Neural-network-based discounted optimal control via an integrated value iteration with accuracy guarantee
2021，NEURAL NETWORKS
Generalized value iteration for discounted optimal control with stability analysis
2021，SYSTEMS & CONTROL LETTERS
Improved Adaptive Critic for Neural Optimal Control of Constrained Nonlinear Discrete-Time Systems
2020，39th Chinese Control Conference (CCC)

Source ：

NEURAL NETWORKS

ISSN： 0893-6080

Year： 2020

Volume： 124

Page： 280-295

7 . 8 0 0

JCR@2022

ESI Discipline： COMPUTER SCIENCE;

ESI HC Threshold：132

Cited Count：

WoS CC Cited Count： 15

SCOPUS Cited Count： 18

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 2

Affiliated Colleges：

信息学部

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to