Neural Q-learning for discrete-time nonlinear zero-sum games with adjustable convergence rate - Details

Author：

Wang, Yuan (Wang, Yuan.) | Wang, Ding (Wang, Ding.) (Scholars：王鼎) | Zhao, Mingming (Zhao, Mingming.) | Liu, Nan (Liu, Nan.) | Qiao, Junfei (Qiao, Junfei.)

Indexed by：

EI Scopus SCIE

Abstract：

In　this　paper,　an　adjustable　Q　-learning　scheme　is　developed　to　solve　the　discrete　-time　nonlinear　zero　-sum　game　problem,　which　can　accelerate　the　convergence　rate　of　the　iterative　Q　-function　sequence.　First,　the　monotonicity　and　convergence　of　the　iterative　Q　-function　sequence　are　analyzed　under　some　conditions.　Moreover,　by　employing　neural　networks,　the　model　-free　tracking　control　problem　can　be　overcome　for　zerosum　games.　Second,　two　practical　algorithms　are　designed　to　guarantee　the　convergence　with　accelerated　learning.　In　one　algorithm,　an　adjustable　acceleration　phase　is　added　to　the　iteration　process　of　Q　-learning,　which　can　be　adaptively　terminated　with　convergence　guarantee.　In　another　algorithm,　a　novel　acceleration　function　is　developed,　which　can　adjust　the　relaxation　factor　to　ensure　the　convergence.　Finally,　through　a　simulation　example　with　the　practical　physical　background,　the　fantastic　performance　of　the　developed　algorithm　is　demonstrated　with　neural　networks.

Keyword：

Adaptive dynamic programming Optimal tracking control Neural networks Q-learning Zero-sum games Convergence rate

Author Community：

[ 1 ] [Wang, Ding]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
[ 2 ] [Wang, Ding]Beijing Univ Technol, Beijing Key Lab Computat Intelligence & Intelligen, Beijing 100124, Peoples R China
[ 3 ] [Wang, Ding]Beijing Univ Technol, Beijing Inst Artificial Intelligence, Beijing 100124, Peoples R China
[ 4 ] [Wang, Ding]Beijing Univ Technol, Beijing Lab Smart Environm Protect, Beijing 100124, Peoples R China

Reprint Author's Address：

[Wang, Ding]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China;;

Email：

wangyuan@emails.bjut.edu.cn |
dingwang@bjut.edu.cn |
zhaomm@emails.bjut.edu.cn |
liunan@emails.bjut.edu.cn |
adqiao@bjut.edu.cn

Show more details

Related Keywords：

Adjustable iterative Q-learning for advanced neural tracking control with stability guarantee
2024，NEUROCOMPUTING
Advanced optimal tracking integrating a neural critic technique for asymmetric constrained zero-sum games ☆
2024，NEURAL NETWORKS
Neural critic learning for tracking control design of constrained nonlinear multi-person zero-sum games
2022，NEUROCOMPUTING
Data-Based Nonaffine Optimal Tracking Control Using Iterative DHP Approach
2020，21st IFAC World Congress on Automatic Control - Meeting Societal Challenges

Source ：

NEURAL NETWORKS

ISSN： 0893-6080

Year： 2024

Volume： 175

7 . 8 0 0

JCR@2022

Cited Count：

WoS CC Cited Count：

SCOPUS Cited Count： 7

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 2

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to