Dichotomy value iteration with parallel learning design towards discrete-time zero-sum games - Details

Author：

Wang, Jiangyu (Wang, Jiangyu.) | Wang, Ding (Wang, Ding.) (Scholars：王鼎) | Li, Xin (Li, Xin.) | Qiao, Junfei (Qiao, Junfei.)

Indexed by：

EI Scopus SCIE

Abstract：

In　this　paper,　a　novel　parallel　learning　framework　is　developed　to　solve　zero-sum　games　for　discrete　-time　nonlinear　systems.　Briefly,　the　purpose　of　this　study　is　to　determine　a　tentative　function　according　to　the　prior　knowledge　of　the　value　iteration　(VI)　algorithm.　The　learning　process　of　the　parallel　controllers　can　be　guided　by　the　tentative　function.　That　is　to　say,　the　neighborhood　of　the　optimal　cost　function　can　be　compressed　within　a　small　range　via　two　typical　exploration　policies.　Based　on　the　parallel　learning　framework,　a　novel　dichotomy　VI　algorithm　is　established　to　accelerate　the　learning　speed.　It　is　shown　that　the　parallel　controllers　will　converge　to　the　optimal　policy　from　contrary　initial　policies.　Finally,　two　typical　systems　are　used　to　demonstrate　the　learning　performance　of　the　constructed　dichotomy　VI　algorithm.(c)　2023　Elsevier　Ltd.　All　rights　reserved.

Keyword：

Zero -sum games Value iteration Artificial neural networks Nonlinear systems Parallel learning Adaptive critic

Author Community：

[ 1 ] [Wang, Jiangyu]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
[ 2 ] [Wang, Ding]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
[ 3 ] [Li, Xin]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
[ 4 ] [Qiao, Junfei]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
[ 5 ] [Wang, Jiangyu]Beijing Univ Technol, Key Lab Computat Intelligence & Intelligent Syst, Beijing 100124, Peoples R China
[ 6 ] [Li, Xin]Beijing Univ Technol, Key Lab Computat Intelligence & Intelligent Syst, Beijing 100124, Peoples R China
[ 7 ] [Qiao, Junfei]Beijing Univ Technol, Key Lab Computat Intelligence & Intelligent Syst, Beijing 100124, Peoples R China
[ 8 ] [Wang, Jiangyu]Beijing Univ Technol, Beijing Inst Artificial Intelligence, Beijing 100124, Peoples R China
[ 9 ] [Li, Xin]Beijing Univ Technol, Beijing Inst Artificial Intelligence, Beijing 100124, Peoples R China
[ 10 ] [Qiao, Junfei]Beijing Univ Technol, Beijing Inst Artificial Intelligence, Beijing 100124, Peoples R China
[ 11 ] [Wang, Jiangyu]Beijing Univ Technol, Beijing Lab Smart Environm Protect, Beijing 100124, Peoples R China
[ 12 ] [Li, Xin]Beijing Univ Technol, Beijing Lab Smart Environm Protect, Beijing 100124, Peoples R China
[ 13 ] [Qiao, Junfei]Beijing Univ Technol, Beijing Lab Smart Environm Protect, Beijing 100124, Peoples R China

Reprint Author's Address：

Email：

wangjiangyu@emails.bjut.edu.cn |
dingwang@bjut.edu.cn |
lixin229038@emails.bjut.edu.cn |
adqiao@bjut.edu.cn

Show more details

Related Keywords：

Discounted Near-Optimal Control of Affine Systems via a Progressive Cost Evolution Formulation
2023，IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS
Discounted Iterative Adaptive Critic Designs With Novel Stability Analysis for Tracking Control
2022，IEEE-CAA JOURNAL OF AUTOMATICA SINICA
Discounted near-optimal regulation of constrained nonlinear systems via generalized value iteration
2021，INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL
Safe Q-Learning for Data-Driven Nonlinear Optimal Control with Asymmetric State Constraints
2024，IEEE-CAA JOURNAL OF AUTOMATICA SINICA

Source ：

NEURAL NETWORKS

ISSN： 0893-6080

Year： 2023

Volume： 167

Page： 751-762

7 . 8 0 0

JCR@2022

Cited Count：

WoS CC Cited Count： 4

SCOPUS Cited Count： 5

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 1

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to