Accomplishing robot grasping task rapidly via adversarial training - Details

Author：

Zuo, Guoyu (Zuo, Guoyu.) (Scholars：左国玉) | Lu, Jiahao (Lu, Jiahao.) | Chen, Kexin (Chen, Kexin.) | Yu, Jianjun (Yu, Jianjun.) | Huang, Xiangsheng (Huang, Xiangsheng.)

Indexed by：

Abstract：

This　paper　proposes　a　robotic　imitation　learning　method　which　integrates　the　deterministic　off-policy　reinforcement　learning　and　generative　adversarial　network.　This　method　allows　the　robot　to　implement　the　grasping　task　rapidly　by　learning　the　reward　function　from　the　demonstration　data.　Firstly,　the　discriminator　is　used　to　learn　the　reward　function　from　demonstrations,　which　can　guide　the　generator　to　complete　the　robot　grasping　task.　Secondly,　the　deep　deterministic　policy　gradient　method　is　used　as　the　generator　for　learning　action　policy　on　the　basis　of　discriminator.　In　particular,　the　demonstration　data　is　also　input　into　the　generator　to　ensure　its　performance.　Finally,　three　experiments　on　the　Push　and　Pick-　and-Place　tasks　are　conducted　in　the　GYM　robotic　environment.　Results　show　that　the　learning　speed　of　our　method　is　much　faster　than　the　stochastic　GAIL　method,　and　it　can　effectively　train　from　the　demonstration　data　in　different　states　of　the　task.　The　proposed　method　can　complete　the　robot　grasping　task　without　environmental　reward　quickly　and　improve　the　stability　of　the　training　process.　©　2018　IEEE

Keyword：

Educational robots Reinforcement learning Robot learning Robotics Stochastic systems Demonstrations Robots Agricultural robots Gradient methods

Author Community：

[ 1 ] [Zuo, Guoyu]Faculty of Information Technology, Beijing University of Technology, Beijing; 100124, China
[ 2 ] [Zuo, Guoyu]Beijing Key Laboratory of Computing Intelligence and Intelligent Systems, Beijing; 100124, China
[ 3 ] [Lu, Jiahao]Faculty of Information Technology, Beijing University of Technology, Beijing; 100124, China
[ 4 ] [Lu, Jiahao]Beijing Key Laboratory of Computing Intelligence and Intelligent Systems, Beijing; 100124, China
[ 5 ] [Chen, Kexin]Faculty of Information Technology, Beijing University of Technology, Beijing; 100124, China
[ 6 ] [Chen, Kexin]Beijing Key Laboratory of Computing Intelligence and Intelligent Systems, Beijing; 100124, China
[ 7 ] [Yu, Jianjun]Faculty of Information Technology, Beijing University of Technology, Beijing; 100124, China
[ 8 ] [Yu, Jianjun]Beijing Key Laboratory of Computing Intelligence and Intelligent Systems, Beijing; 100124, China
[ 9 ] [Huang, Xiangsheng]Institute of Automation, Chinese Academy of Sciences, Beijing; 100190, China

Reprint Author's Address：

左国玉
[zuo, guoyu]faculty of information technology, beijing university of technology, beijing; 100124, china;;[zuo, guoyu]beijing key laboratory of computing intelligence and intelligent systems, beijing; 100124, china

Email：

zuoguoyu@bjut.edu.cn

Show more details

Related Keywords：