• 综合
  • 标题
  • 关键词
  • 摘要
  • 学者
  • 期刊-刊名
  • 期刊-ISSN
  • 会议名称
搜索

作者:

Liang, Mingming (Liang, Mingming.) | Wang, Ding (Wang, Ding.) (学者:王鼎) | Liu, Derong (Liu, Derong.)

收录:

EI SCIE

摘要:

In this paper, a novel policy iteration adaptive dynamic programming (ADP) algorithm is presented which is called "local policy iteration ADP algorithm" to obtain the optimal control for discrete stochastic processes. In the proposed local policy iteration ADP algorithm, the iterative decision rules are updated in a local space of the whole state space. Hence, we can significantly reduce the computational burden for the CPU in comparison with the conventional policy iteration algorithm. By analyzing the convergence properties of the proposed algorithm, it is shown that the iterative value functions are monotonically nonincreasing. Besides, the iterative value functions can converge to the optimum in a local policy space. In addition, this local policy space will be described in detail for the first time. Under a few weak constraints, it is also shown that the iterative value function will converge to the optimal performance index function of the global policy space. Finally, a simulation example is presented to validate the effectiveness of the developed method.

关键词:

Adaptive critic designs adaptive dynamic programming (ADP) Aerospace electronics Dynamical systems Heuristic algorithms Iterative algorithms local policy iteration neuro-dynamic programming optimal control Optimal control Performance analysis stochastic processes Stochastic processes

作者机构:

  • [ 1 ] [Liang, Mingming]Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China
  • [ 2 ] [Wang, Ding]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
  • [ 3 ] [Wang, Ding]Beijing Univ Technol, Beijing Key Lab Computat Intelligence & Intellige, Beijing 100124, Peoples R China
  • [ 4 ] [Liu, Derong]Guangdong Univ Technol, Sch Automat, Guangzhou 510006, Peoples R China

通讯作者信息:

  • 王鼎

    [Wang, Ding]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China

查看成果更多字段

相关关键词:

来源 :

IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS

ISSN: 2168-2216

年份: 2020

期: 11

卷: 50

页码: 3972-3985

8 . 7 0 0

JCR@2022

ESI学科: ENGINEERING;

ESI高被引阀值:28

JCR分区:1

被引次数:

WoS核心集被引频次: 11

SCOPUS被引频次: 8

ESI高被引论文在榜: 0 展开所有

万方被引频次:

中文被引频次:

近30日浏览量: 2

在线人数/总访问数:5438/2938007
地址:北京工业大学图书馆(北京市朝阳区平乐园100号 邮编:100124) 联系我们:010-67392185
版权所有:北京工业大学图书馆 站点建设与维护:北京爱琴海乐之技术有限公司