Stability Analysis of Model-Free Control under Iterative Q-learning Algorithms - Details

Author：

Zhao, Mingming (Zhao, Mingming.) | Wang, Ding (Wang, Ding.) | Qiao, Junfei (Qiao, Junfei.) | Gao, Ning (Gao, Ning.) | Xin, Peng (Xin, Peng.)

Indexed by：

EI Scopus

Abstract：

This　article　investigates　the　stability　of　the　closed-loop　system　under　the　iterative　control　policy　generated　by　various　iterative　Q-learning　algorithms.　First,　a　new　stability　criterion　is　developed　for　the　value-iteration-based　Q-learning　(VIQL)　algorithm,　which　is　initialized　by　a　positive　semi-definite　function.　Through　this　operation,　VIQL　can　provide　an　initial　admissible　control　policy　for　the　policy-iteration-based　Q-learning　(PIQL)　algorithm.　It　is　emphasized　that　evolving　control　policies　generated　by　PIQL　can　stabilize　the　controlled　system.　The　numerical　result　is　provided　to　verify　the　effectiveness　of　the　present　algorithms.　©　2023　IEEE.

Keyword：

Learning algorithms Iterative methods Stability criteria Dynamic programming Reinforcement learning Closed loop systems

Author Community：

[ 1 ] [Zhao, Mingming]Beijing University of Technology, Faculty of Information Technology, Beijing, China
[ 2 ] [Wang, Ding]Beijing University of Technology, Faculty of Information Technology, Beijing, China
[ 3 ] [Qiao, Junfei]Beijing University of Technology, Faculty of Information Technology, Beijing, China
[ 4 ] [Gao, Ning]Beijing University of Technology, Faculty of Information Technology, Beijing, China
[ 5 ] [Xin, Peng]Beijing University of Technology, Faculty of Information Technology, Beijing, China

Reprint Author's Address：

Email：

Show more details

Related Keywords：

Reinforcement Learning With Adjustable Convergence Rate for Data-Based Nonlinear Control
2024，36th Chinese Control and Decision Conference, CCDC 2024
Event-Driven Robust Guaranteed Cost Control via an Improved Adaptive Critic Learning Strategy
2022，4th International Conference on Industrial Artificial Intelligence, IAI 2022
A synthetic approach for robust constrained iterative learning control of piecewise affine batch processes
2012，Automatica
An improved cuckoo search algorithm for semiconductor final testing scheduling
2017，13th IEEE Conference on Automation Science and Engineering, CASE 2017

Source ：

Year： 2023

Page： 39-43

Language： English

Cited Count：

WoS CC Cited Count：

SCOPUS Cited Count：

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 3

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search Engineering Village

Type
Departments

All Years Choose Year From to