• 综合
  • 标题
  • 关键词
  • 摘要
  • 学者
  • 期刊-刊名
  • 期刊-ISSN
  • 会议名称
搜索

作者:

Dai, Gui-Ping (Dai, Gui-Ping.) | Tang, Hao (Tang, Hao.) | Xi, Hong-Sheng (Xi, Hong-Sheng.)

收录:

EI Scopus PKU CSCD

摘要:

Based on the theory of performance potentials and the method of equivalent Markov process, the performance optimization problem is discussed for a class of semi-Markov decision processes (SMDPs) with parameterized randomized stationary policies and a simulation optimization algorithm is proposed. Firstly, a uniform Markov chain is defined through the equivalent Markov process. Secondly, the gradient of the average cost performance with respect to the policy parameters is then estimated by simulating a single sample path of the uniformized Markov chain, so that an optimal (or suboptimal) randomized stationary policy can be found by iterating the parameters. The derived algorithm can meet the requirements of performance optimization of many different systems with large-scale state space, an artificial neural network is also used to approximate the parameterized randomized stationary policies and avoid the curse of dimensionality. Finally, convergence of the algorithm with probability one on an infinite sample path is considered, and a numerical example is provided to illustrate the application of the algorithm.

关键词:

Optimization Neural networks Markov processes Convergence of numerical methods Probability Computer simulation Dynamic programming

作者机构:

  • [ 1 ] [Dai, Gui-Ping]College of Electronic and Control Engineering, Beijing University of Technology, Beijing 100022, China
  • [ 2 ] [Dai, Gui-Ping]Department of Automation, University of Science and Technology of China, Hefei 230027, China
  • [ 3 ] [Tang, Hao]Department of Computer, Hefei University of Technology, Hefei 230009, China
  • [ 4 ] [Xi, Hong-Sheng]Department of Automation, University of Science and Technology of China, Hefei 230027, China

通讯作者信息:

电子邮件地址:

查看成果更多字段

相关关键词:

相关文章:

来源 :

Control Theory and Applications

ISSN: 1000-8152

年份: 2006

期: 4

卷: 23

页码: 547-551

被引次数:

WoS核心集被引频次: 0

SCOPUS被引频次:

ESI高被引论文在榜: 0 展开所有

万方被引频次:

中文被引频次:

近30日浏览量: 0

归属院系:

在线人数/总访问数:412/3909430
地址:北京工业大学图书馆(北京市朝阳区平乐园100号 邮编:100124) 联系我们:010-67392185
版权所有:北京工业大学图书馆 站点建设与维护:北京爱琴海乐之技术有限公司