• 综合
  • 标题
  • 关键词
  • 摘要
  • 学者
  • 期刊-刊名
  • 期刊-ISSN
  • 会议名称
搜索

作者:

Zuo, Guoyu (Zuo, Guoyu.) (学者:左国玉) | Zhao, Qishen (Zhao, Qishen.) | Lu, Jiahao (Lu, Jiahao.) | Li, Jiangeng (Li, Jiangeng.)

收录:

EI Scopus SCIE

摘要:

The goal of reinforcement learning is to enable an agent to learn by using rewards. However, some robotic tasks naturally specify with sparse rewards, and manually shaping reward functions is a difficult project. In this article, we propose a general and model-free approach for reinforcement learning to learn robotic tasks with sparse rewards. First, a variant of Hindsight Experience Replay, Curious and Aggressive Hindsight Experience Replay, is proposed to improve the sample efficiency of reinforcement learning methods and avoid the need for complicated reward engineering. Second, based on Twin Delayed Deep Deterministic policy gradient algorithm, demonstrations are leveraged to overcome the exploration problem and speed up the policy training process. Finally, the action loss is added into the loss function in order to minimize the vibration of output action while maximizing the value of the action. The experiments on simulated robotic tasks are performed with different hyperparameters to verify the effectiveness of our method. Results show that our method can effectively solve the sparse reward problem and obtain a high learning speed.

关键词:

CAHER Robot learning sparse reward demonstrations reinforcement learning

作者机构:

  • [ 1 ] [Zuo, Guoyu]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
  • [ 2 ] [Zhao, Qishen]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
  • [ 3 ] [Lu, Jiahao]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
  • [ 4 ] [Li, Jiangeng]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
  • [ 5 ] [Zuo, Guoyu]Beijing Key Lab Comp Intelligence & Intelligent S, Beijing, Peoples R China
  • [ 6 ] [Zhao, Qishen]Beijing Key Lab Comp Intelligence & Intelligent S, Beijing, Peoples R China
  • [ 7 ] [Lu, Jiahao]Beijing Key Lab Comp Intelligence & Intelligent S, Beijing, Peoples R China
  • [ 8 ] [Li, Jiangeng]Beijing Key Lab Comp Intelligence & Intelligent S, Beijing, Peoples R China

通讯作者信息:

  • 左国玉

    [Zuo, Guoyu]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China

电子邮件地址:

查看成果更多字段

相关关键词:

相关文章:

来源 :

INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS

ISSN: 1729-8814

年份: 2020

期: 1

卷: 17

2 . 3 0 0

JCR@2022

ESI学科: ENGINEERING;

ESI高被引阀值:115

被引次数:

WoS核心集被引频次: 11

SCOPUS被引频次: 18

ESI高被引论文在榜: 0 展开所有

万方被引频次:

中文被引频次:

近30日浏览量: 0

归属院系:

在线人数/总访问数:154/4525126
地址:北京工业大学图书馆(北京市朝阳区平乐园100号 邮编:100124) 联系我们:010-67392185
版权所有:北京工业大学图书馆 站点建设与维护:北京爱琴海乐之技术有限公司