Offline and Online Adaptive Critic Control Designs With Stability Guarantee Through Value Iteration - Details

Author：

Ha, Mingming (Ha, Mingming.) | Wang, Ding (Wang, Ding.) (Scholars：王鼎) | Liu, Derong (Liu, Derong.)

Indexed by：

EI Scopus SCIE

Abstract：

This　article　is　concerned　with　the　stability　of　the　closed-loop　system　using　various　control　policies　generated　by　value　iteration.　Some　stability　properties　involving　admissibility　criteria,　the　attraction　domain,　and　so　forth,　are　investigated.　An　offline　integrated　value　iteration　(VI)　scheme　with　a　stability　guarantee　is　developed　by　combining　the　advantages　of　VI　and　policy　iteration,　which　is　convenient　to　obtain　admissible　control　policies.　Also,　based　on　the　concept　of　attraction　domain,　an　online　adaptive　dynamic　programming　algorithm　using　immature　control　policies　is　developed.　Remarkably,　it　is　ensured　that　the　state　trajectory　under　the　online　algorithm　converges　to　the　origin.　Particularly,　for　linear　systems,　the　online　ADP　algorithm　with　a　general　scheme　possesses　more　enhanced　stability　property.　The　theoretical　results　reveal　that　the　stability　of　the　linear　system　can　be　guaranteed　even　if　the　control　policy　sequence　includes　finite　unstable　elements.　The　numerical　results　verify　the　effectiveness　of　the　present　algorithms.

Keyword：

Heuristic algorithms Stability criteria Numerical stability Cost function Power system stability Adaptive dynamic programming online adaptive critic control policy iteration (PI) reinforcement learning (RL) value iteration (VI) Trajectory Asymptotic stability asymptotic stability

Author Community：

[ 1 ] [Ha, Mingming]Univ Sci & Technol Beijing, Sch Automat & Elect Engn, Beijing 100083, Peoples R China
[ 2 ] [Wang, Ding]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
[ 3 ] [Wang, Ding]Beijing Univ Technol, Beijing Key Lab Computat Intelligence & Intellige, Beijing 100124, Peoples R China
[ 4 ] [Liu, Derong]Univ Illinois, Dept Elect & Comp Engn, Chicago, IL 60607 USA

Reprint Author's Address：

Email：

hamingming_0705@foxmail.com |
dingwang@bjut.edu.cn |
derong@uic.edu

Show more details

Related Keywords：

System Stability of Learning-Based Linear Optimal Control With General Discounted Value Iteration
2022，IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS
A Novel Value Iteration Scheme With Adjustable Convergence Rate
2022，IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS
Decentralized Optimal Neurocontroller Design for Mismatched Interconnected Systems via Integral Policy Iteration
2024，IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS
Discounted Iterative Adaptive Critic Designs With Novel Stability Analysis for Tracking Control
2022，IEEE-CAA JOURNAL OF AUTOMATICA SINICA

Source ：

IEEE TRANSACTIONS ON CYBERNETICS

ISSN： 2168-2267

Year： 2021

1 1 . 8 0 0

JCR@2022

ESI Discipline： COMPUTER SCIENCE;

ESI HC Threshold：87

JCR Journal Grade：1

Cited Count：

WoS CC Cited Count： 54

SCOPUS Cited Count： 58

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 1

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to