Reinforcement learning function approximation algorithm based on linear average - Details

Author：

Tao, Jun-Yuan (Tao, Jun-Yuan.) | Sun, Jin-Wei (Sun, Jin-Wei.) | Li, De-Sheng (Li, De-Sheng.) (Scholars：李德胜)

Indexed by：

EI Scopus PKU CSCD

Abstract：

A　reinforcement　learning　algorithm　based　on　linear　average　is　proposed,　which　is　used　to　solve　non-convergent　problems　of　reinforcement　learning　function　approximation　in　continuous　state　space.　According　to　contraction　theory,　this　algorithm　is　based　on　gradient　descent　method,　which　adopts　linear　average　as　performance　evaluation　of　value　function.　So　the　iterative　process　of　value　function　becomes　a　convergent　process　to　a　fixed　value.　A　standard　reinforcement　learning　problem,　Mountain　Car　Problem,　is　used　to　verify　the　performance　of　the　algorithm.　Results　show　the　effectiveness,　feasibility　and　quick　convergence　of　the　algorithm.

Keyword：

Gradient methods Reinforcement learning Learning algorithms Approximation algorithms Automation

Author Community：

[ 1 ] [Tao, Jun-Yuan]School of Electrical Engineering and Automation, Harbin Institute of Technology, Harbin 150001, China
[ 2 ] [Sun, Jin-Wei]School of Electrical Engineering and Automation, Harbin Institute of Technology, Harbin 150001, China
[ 3 ] [Li, De-Sheng]School of Mechanical Engineering and Applied Electronic Technology, Beijing University of Technology, Beijing 100022, China

Reprint Author's Address：

Email：

tjy1975@126.com

Show more details

Related Keywords：

A spectral kernel learning algorithm for classification
2010，
Stochastic Online Learning for Mobile Edge Computing: Learning from Changes
2019，IEEE Communications Magazine
Accomplishing robot grasping task rapidly via adversarial training
2019，2019 IEEE International Conference on Real-Time Computing and Robotics, RCAR 2019
Fault diagnosis of oil pump based on wavelet neural network
2007，International MultiConference of Engineers and Computer Scientists 2007, IMECS 2007

Source ：

Journal of Jilin University (Engineering and Technology Edition)

ISSN： 1671-5497

Year： 2008

Issue： 6

Volume： 38

Page： 1407-1411

Cited Count：

WoS CC Cited Count：

SCOPUS Cited Count：

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 2

Affiliated Colleges：

材料与制造学部本学院/部未明确归属的数据

材料与制造学部机械工程与应用电子技术学院

Get Fulltext

Library Discovery Baidu Scholar Search Engineering Village

Type
Departments

All Years Choose Year From to