• 综合
  • 标题
  • 关键词
  • 摘要
  • 学者
  • 期刊-刊名
  • 期刊-ISSN
  • 会议名称
搜索

作者:

Wang, Ding (Wang, Ding.) (学者:王鼎) | Ren, Jin (Ren, Jin.) | Ha, Mingming (Ha, Mingming.) | Qiao, Junfei (Qiao, Junfei.)

收录:

EI Scopus SCIE

摘要:

For discounted optimal regulation design, the stability of the controlled system is affected by the discount factor. If an inappropriate discount factor is employed, the optimal control policy might be unstabilizing. Therefore, in this article, the effect of the discount factor on the stabilization of control strategies is discussed. We develop the system stability criterion and the selection rules of the discount factor with respect to the linear quadratic regulator problem under the general discounted value iteration algorithm. Based on the monotonicity of the value function sequence, the method to judge the stability of the controlled system is established during the iteration process. In addition, once some stability conditions are satisfied at a certain iteration step, all control policies after this iteration step are stabilizing. Furthermore, combined with the undiscounted optimal control problem, the practical rule of how to select an appropriate discount factor is constructed. Finally, several simulation examples with physical backgrounds are conducted to demonstrate the present theoretical results.

关键词:

Regulators Stability criteria reinforcement learning (RL) discount factor Costs Adaptive critic design optimal control Asymptotic stability stability Heuristic algorithms value iteration (VI) Optimal control linear quadratic regulator (LQR) Cost function

作者机构:

  • [ 1 ] [Wang, Ding]Beijing Univ Technol, Fac Informat Technol, Beijing Key Lab Computat Intelligence & Intellige, Beijing Lab Smart Environm Protect, Beijing 100124, Peoples R China
  • [ 2 ] [Ren, Jin]Beijing Univ Technol, Fac Informat Technol, Beijing Key Lab Computat Intelligence & Intellige, Beijing Lab Smart Environm Protect, Beijing 100124, Peoples R China
  • [ 3 ] [Qiao, Junfei]Beijing Univ Technol, Fac Informat Technol, Beijing Key Lab Computat Intelligence & Intellige, Beijing Lab Smart Environm Protect, Beijing 100124, Peoples R China
  • [ 4 ] [Wang, Ding]Beijing Univ Technol, Beijing Inst Artificial Intelligence, Beijing 100124, Peoples R China
  • [ 5 ] [Ren, Jin]Beijing Univ Technol, Beijing Inst Artificial Intelligence, Beijing 100124, Peoples R China
  • [ 6 ] [Qiao, Junfei]Beijing Univ Technol, Beijing Inst Artificial Intelligence, Beijing 100124, Peoples R China
  • [ 7 ] [Ha, Mingming]Univ Sci & Technol Beijing, Sch Automat & Elect Engn, Beijing 100083, Peoples R China

通讯作者信息:

查看成果更多字段

相关关键词:

来源 :

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS

ISSN: 2162-237X

年份: 2022

期: 9

卷: 34

页码: 6504-6514

1 0 . 4

JCR@2022

1 0 . 4 0 0

JCR@2022

ESI学科: COMPUTER SCIENCE;

ESI高被引阀值:46

JCR分区:1

中科院分区:1

被引次数:

WoS核心集被引频次: 41

SCOPUS被引频次: 52

ESI高被引论文在榜: 0 展开所有

万方被引频次:

中文被引频次:

近30日浏览量: 12

归属院系:

在线人数/总访问数:1541/4284268
地址:北京工业大学图书馆(北京市朝阳区平乐园100号 邮编:100124) 联系我们:010-67392185
版权所有:北京工业大学图书馆 站点建设与维护:北京爱琴海乐之技术有限公司