Sparse Reward Based Manipulator Motion Planning by Using High Speed Learning from Demonstrations - Details

Author：

Zuo, Guoyu (Zuo, Guoyu.) (Scholars：左国玉) | Lu, Jiahao (Lu, Jiahao.) | Pan, Tingting (Pan, Tingting.)

Indexed by：

EI Scopus

Abstract：

This　paper　proposed　a　high　speed　learning　from　demonstrations　(LfD)　method　for　sparse　reward　based　motion　planning　problem　of　manipulator　by　using　hindsight　experience　replay　(HER)　mechanism　and　deep　deterministic　policy　gradient　(DDPG)　method.　First,　a　demonstrations　replay　buffer　and　an　agent　exploration　replay　buffer　are　created　for　storing　experience　data,　and　the　hindsight　experience　replay　mechanism　is　subsequently　used　to　acquire　the　experience　data　from　the　two　replay　buffers.　Then,　the　deep　deterministic　policy　gradient　method　is　used　to　learn　the　experience　data　and　finally　fulfil　the　manipulator　motion　planning　tasks　under　the　sparse　reward.　Last,　experiments　on　the　pushing　and　pick-and-place　tasks　were　conducted　in　the　robotics　environment　in　the　gym.　Results　show　that　the　training　speed　is　increased　to　at　least　10　times　as　compared　to　the　deep　deterministic　policy　gradient　method　without　demonstrations　data.　In　addition,　the　proposed　method　can　effectively　utilize　the　sparse　reward,　and　the　agent　can　quickly　complete　the　task　even　under　the　low　success　rate　of　demonstrations　data.　©　2018　IEEE.

Keyword：

Robot programming Biomimetics Robotics Manipulators Demonstrations User experience Motion planning Agricultural robots Gradient methods

Author Community：

[ 1 ] [Zuo, Guoyu]Faculty of Information Technology, Beijing University of Technology, Beijing; 100124, China
[ 2 ] [Lu, Jiahao]Faculty of Information Technology, Beijing University of Technology, Beijing; 100124, China
[ 3 ] [Pan, Tingting]Faculty of Information Technology, Beijing University of Technology, Beijing; 100124, China

Reprint Author's Address：

Email：

Show more details

Related Keywords：

A revised Gaussian distribution sampling scheme based on RRT∗ algorithms in robot motion planning
2017，3rd International Conference on Control, Automation and Robotics, ICCAR 2017
Research and implementation of robot arm task imitation system based on RNN
2017，2017 IEEE International Conference on Robotics and Biomimetics, ROBIO 2017
Human Motion Prediction Based on Visual Tracking
2019，4th International Conference on Robotics and Automation Engineering, ICRAE 2019
Study of human-like locomotion for humanoid robot based on human motion capture data
2016，2016 IEEE International Conference on Robotics and Biomimetics, ROBIO 2016

Source ：

Year： 2018

Page： 518-523

Language： English

Cited Count：

WoS CC Cited Count： 0

SCOPUS Cited Count： 2

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 4

Affiliated Colleges：

信息学部

Get Fulltext

DOI Library Discovery Baidu Scholar Search Engineering Village

Type
Departments

All Years Choose Year From to