Imitation Learning from Suboptimal Demonstrations via Discriminator Weighting - Details

Author：

He, Liuyuan (He, Liuyuan.) | Zuo, Guoyu (Zuo, Guoyu.) | Li, Jiangeng (Li, Jiangeng.) | Yu, Shuangyue (Yu, Shuangyue.)

Indexed by：

EI Scopus

Abstract：

Imitation　learning　algorithms　for　robotics　applications　require　sufficient　optimal　data　to　learn　well-performing　strategies.　State-of-the-art　approaches　utilize　pre-labeled　data　or　interaction　with　the　environment　to　filter　suboptimal　data,　which　is　time-consuming　and　laborious　in　reality.　In　this　paper,　we　propose　a　new　approach　that　avoids　manual　labeling　or　environment　interaction.　We　design　an　additional　discriminator　for　the　behavioral　cloning　approach　to　distinguish　the　optimal　and　suboptimal　data　in　order　to　influence　policy　learning　and　avoid　suboptimal　behaviors.　Within　this　framework,　we　design　a　new　imitation　learning　algorithm　that　utilizes　the　output　of　the　discriminator　as　weights　to　learn　efficiently　on　datasets　containing　suboptimal　data.　We　evaluate　the　performance　of　the　proposed　method　in　four　environments　and　compare　it　with　three　benchmark　methods.　The　results　illustrate　that　our　method　has　better　performance　when　dealing　with　datasets　containing　suboptimal　data.　The　method　we　proposed　can　distinguish　data　with　higher　values　in　the　dataset　and　enable　the　agent　to　learn　high-performance　policy　from　imperfect　demonstrations　or　a　small　amount　of　data.　©　2024　IEEE.

Keyword：

Benchmarking Discriminators Learning algorithms Clone cells Cloning

Author Community：

[ 1 ] [He, Liuyuan]Beijing University Of Technology, Faculty Of Information Technology, Beijing; 100124, China
[ 2 ] [Zuo, Guoyu]Beijing University Of Technology, Faculty Of Information Technology, Beijing; 100124, China
[ 3 ] [Li, Jiangeng]Beijing University Of Technology, Faculty Of Information Technology, Beijing; 100124, China
[ 4 ] [Yu, Shuangyue]Beijing University Of Technology, Faculty Of Information Technology, Beijing; 100124, China

Reprint Author's Address：

Email：

Show more details

Related Keywords：

A fast and efficient two-phase sequential learning algorithm for spatial architecture neural network
2014，Applied Soft Computing Journal
Artificial bee colony algorithm based on inductive pheromone updating and diffusion
2013，Computer Research and Development
A Self-Organizing Modular Neural Network for Nonlinear System Modeling
2020，2020 International Joint Conference on Neural Networks, IJCNN 2020
Mouse-Brain Topology Improved Evolutionary Neural Network for Efficient Reinforcement Learning
2022，5th IFIP TC 12 International Conference on Intelligence Science, ICIS 2022

Source ：

Year： 2024

Page： 5566-5571

Language： English

Cited Count：

WoS CC Cited Count：

SCOPUS Cited Count：

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 3

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search Engineering Village

Type
Departments

All Years Choose Year From to