• 综合
  • 标题
  • 关键词
  • 摘要
  • 学者
  • 期刊-刊名
  • 期刊-ISSN
  • 会议名称
搜索

作者:

Tao, Junyuan (Tao, Junyuan.) | Li, Desheng (Li, Desheng.) (学者:李德胜)

收录:

EI Scopus

摘要:

RoboCup offers a set of challenges for machine learning researchers because it is a dynamic, nondeterministic, goal delayed and continuous state space problem. Reinforcement learning (RL) is often used for strategy learning in RoboCup, which is a method to learn an optimal control policy for sequential decision-making problems. But it is difficult to apply RL to continuous state space problems because of the exponential growth of states in the number of state variables. An effective method is to combine RL with function approximation. However, this combination sometimes leads to diverge. In this paper, we analyze the main reason that cause the non-convergent of the current approximation RL algorithms and propose an optimal strategy learning method. The two processes - value evaluation and policy improvement in RL have been separated. Policy search process is controlled strictly in the direction of improving performance according the evaluation value provided by the value function. And we apply this algorithm to a standard RoboCup sub-problem-Keepaway successfully. Experiment result has verified the effective of the method and showed the algorithm could converge to a local optimal policy. ©2006 IEEE.

关键词:

Decision making Function evaluation Learning algorithms Problem solving Reinforcement learning Robotics State space methods

作者机构:

  • [ 1 ] [Tao, Junyuan]Department of Automatic Measurement and Control, Harbin Institute of Technology, Harbin, Heilongjiang Province, China
  • [ 2 ] [Li, Desheng]Department of Mechanical and Electronic Engineering, Beijing University of Technology, Beijing, China

通讯作者信息:

电子邮件地址:

查看成果更多字段

相关关键词:

相关文章:

来源 :

年份: 2006

卷: 2006

页码: 301-305

语种: 英文

被引次数:

WoS核心集被引频次: 0

SCOPUS被引频次: 2

ESI高被引论文在榜: 0 展开所有

万方被引频次:

中文被引频次:

近30日浏览量: 2

在线人数/总访问数:1857/2973044
地址:北京工业大学图书馆(北京市朝阳区平乐园100号 邮编:100124) 联系我们:010-67392185
版权所有:北京工业大学图书馆 站点建设与维护:北京爱琴海乐之技术有限公司