• 综合
  • 标题
  • 关键词
  • 摘要
  • 学者
  • 期刊-刊名
  • 期刊-ISSN
  • 会议名称
搜索

作者:

Tao Junyuan (Tao Junyuan.) | Li Desheng (Li Desheng.) (学者:李德胜)

收录:

CPCI-S

摘要:

RoboCup offers a set of challenges for machine learning researchers because it is a dynamic, nondeterministic, goal delayed and continuous state space problem. Reinforcement learning (RL) is often used for strategy learning in RoboCup, which is a method to learn an optimal control policy for sequential decision-making problems. But it is difficult to apply RL, to continuous state space problems because of the exponential growth of states in the number of state variables. An effective method is to combine RL with function approximation. However, this combination sometimes leads to diverge. In this paper, we analyze the main reason that cause the non-convergent of the current approximation RL algorithms and propose an optimal strategy learning method. The two processes - value evaluation and policy improvement in RL have been separated. Policy search process is controlled strictly in the direction of improving performance according the evaluation value provided by the value function. And we apply this algorithm to a standard RoboCup sub-problem-Keepaway successfully. Experiment result has verified the effective of the method and showed the algorithm could converge to a local optimal policy.

关键词:

function approximation optimal control policy reinforcement learning RoboCup

作者机构:

  • [ 1 ] [Tao Junyuan]Harbin Inst Technol, Dept Automat Measurement & Control, Harbin 150006, Heilingjiang, Peoples R China
  • [ 2 ] [Li Desheng]Beijing Univ Technol, Dept Elect Mech Engn, Beijing, Peoples R China

通讯作者信息:

  • [Tao Junyuan]Harbin Inst Technol, Dept Automat Measurement & Control, Harbin 150006, Heilingjiang, Peoples R China

电子邮件地址:

查看成果更多字段

相关关键词:

相关文章:

来源 :

IEEE ICMA 2006: PROCEEDING OF THE 2006 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION, VOLS 1-3, PROCEEDINGS

年份: 2006

页码: 301-,

语种: 英文

被引次数:

WoS核心集被引频次: 1

SCOPUS被引频次:

ESI高被引论文在榜: 0 展开所有

万方被引频次:

中文被引频次:

近30日浏览量: 2

在线人数/总访问数:5542/2967507
地址:北京工业大学图书馆(北京市朝阳区平乐园100号 邮编:100124) 联系我们:010-67392185
版权所有:北京工业大学图书馆 站点建设与维护:北京爱琴海乐之技术有限公司