Reinforcement Learning With Adjustable Convergence Rate for Data-Based Nonlinear Control - Details

Author：

Tang, Guohan (Tang, Guohan.) | Wang, Ding (Wang, Ding.) | Li, Xin (Li, Xin.) | Ren, Jin (Ren, Jin.) | Liu, Nan (Liu, Nan.)

Indexed by：

EI Scopus

Abstract：

In　this　paper,　a　value-iteration-based　off-policy　Q-learning　algorithm　is　developed.　The　proposed　algorithm　solves　the　optimal　regulation　problem　of　nonlinear　systems　with　unknown　dynamics.　Under　the　off-policy　mechanism,　the　algorithm　utilizes　the　behavioral　policy　for　full　exploration,　which　is　beneficial　to　avoid　the　target　policy　from　falling　into　the　local　optimal　solution.　In　addition,　a　relaxation　factor　is　introduced　to　adjust　the　convergence　rate　of　the　cost　function　sequence.　To　implement　the　algorithm,　the　critic　network　and　the　action　network　are　used　to　approximate　the　optimal　Q-function　and　the　optimal　control　policy,　respectively.　Finally,　a　simulation　example　is　presented　to　demonstrate　the　effectiveness　of　the　proposed　algorithm.　©　2024　IEEE.

Keyword：

Dynamic programming Learning systems Reinforcement learning Learning algorithms Nonlinear systems Cost functions Adaptive control systems Iterative methods Discrete time control systems

Author Community：

[ 1 ] [Tang, Guohan]Beijing University Of Technology, Faculty Of Information Technology, Beijing, China
[ 2 ] [Wang, Ding]Beijing University Of Technology, Faculty Of Information Technology, Beijing, China
[ 3 ] [Li, Xin]Beijing University Of Technology, Faculty Of Information Technology, Beijing, China
[ 4 ] [Ren, Jin]Beijing University Of Technology, Faculty Of Information Technology, Beijing, China
[ 5 ] [Liu, Nan]Beijing University Of Technology, Faculty Of Information Technology, Beijing, China

Reprint Author's Address：

Email：

Show more details

Related Keywords：

Event-Driven Robust Guaranteed Cost Control via an Improved Adaptive Critic Learning Strategy
2022，4th International Conference on Industrial Artificial Intelligence, IAI 2022
Decentralized Tracking Control of Interconnected Nonlinear Systems with Unmatched External Disturbances
2024，43rd Chinese Control Conference, CCC 2024
Improved Adaptive Critic for Neural Optimal Control of Constrained Nonlinear Discrete-Time Systems
2020，39th Chinese Control Conference, CCC 2020
Stability Analysis of Model-Free Control under Iterative Q-learning Algorithms
2023，9th International Conference on Control Science and Systems Engineering, ICCSSE 2023

Source ：

Year： 2024

Page： 2717-2722

Language： English

Cited Count：

WoS CC Cited Count：

SCOPUS Cited Count： 1

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 6

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search Engineering Village

Type
Departments

All Years Choose Year From to