Data-Driven Q-Learning Control for Nonlinear Systems Involving Parallel Multi-Step Deduction - Details

Author：

Wang, Jiangyu (Wang, Jiangyu.) | Wang, Ding (Wang, Ding.) | Zhao, Mingming (Zhao, Mingming.) | Qiao, Junfei (Qiao, Junfei.)

Indexed by：

EI Scopus

Abstract：

When　facing　large　amounts　of　data,　it　is　a　challenging　task　to　optimize　policies　by　using　all　data　at　once.　In　this　paper,　a　data-driven　Q-learning　scheme　with　parallel　multi-step　deduction　is　developed　to　improve　learning　efficiency　using　small　batch　data　for　discrete-time　nonlinear　control.　Specifically,　a　data-driven　model　is　established　by　making　use　of　all　data　in　advance.　Then,　the　proposed　algorithm　can　parallel　deduce　the　small　batch　data　to　effectively　accelerate　the　learning　process.　Furthermore,　we　can　adjust　the　step　size　of　multi-step　deduction　to　balance　the　utilization　between　data　and　model.　The　near-optimal　policy　can　be　obtained　ultimately　by　using　hybrid　data　from　the　real　system　and　data-driven　model.　Finally,　a　torsional　pendulum　plant　is　given　to　demonstrate　the　effectiveness　of　the　proposed　method.　©　2024　IEEE.

Keyword：

Federated learning Adversarial machine learning Contrastive Learning

Author Community：

[ 1 ] [Wang, Jiangyu]Faculty of Information Technology, Beijing University of Technology, Beijing; 100124, China
[ 2 ] [Wang, Jiangyu]Beijing University of Technology, Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing; 100124, China
[ 3 ] [Wang, Jiangyu]Beijing Institute of Artificial Intelligence, Beijing University of Technology, Beijing; 100124, China
[ 4 ] [Wang, Jiangyu]Beijing University of Technology, Beijing Laboratory of Smart Environmental Protection, Beijing; 100124, China
[ 5 ] [Wang, Ding]Faculty of Information Technology, Beijing University of Technology, Beijing; 100124, China
[ 6 ] [Wang, Ding]Beijing University of Technology, Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing; 100124, China
[ 7 ] [Wang, Ding]Beijing Institute of Artificial Intelligence, Beijing University of Technology, Beijing; 100124, China
[ 8 ] [Wang, Ding]Beijing University of Technology, Beijing Laboratory of Smart Environmental Protection, Beijing; 100124, China
[ 9 ] [Zhao, Mingming]Faculty of Information Technology, Beijing University of Technology, Beijing; 100124, China
[ 10 ] [Zhao, Mingming]Beijing University of Technology, Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing; 100124, China
[ 11 ] [Zhao, Mingming]Beijing Institute of Artificial Intelligence, Beijing University of Technology, Beijing; 100124, China
[ 12 ] [Zhao, Mingming]Beijing University of Technology, Beijing Laboratory of Smart Environmental Protection, Beijing; 100124, China
[ 13 ] [Qiao, Junfei]Faculty of Information Technology, Beijing University of Technology, Beijing; 100124, China
[ 14 ] [Qiao, Junfei]Beijing University of Technology, Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing; 100124, China
[ 15 ] [Qiao, Junfei]Beijing Institute of Artificial Intelligence, Beijing University of Technology, Beijing; 100124, China
[ 16 ] [Qiao, Junfei]Beijing University of Technology, Beijing Laboratory of Smart Environmental Protection, Beijing; 100124, China

Reprint Author's Address：

Email：

Show more details

Related Keywords：

HMCL: High-order Graph Neural Networks with Multi-view Contrastive Learning for Recommendation
2024，11th International Conference on Information Technology and Quantitative Management, ITQM 2024
Decomposed-Representation based Causality Estimating Model
2024，3rd International Conference on Computer, Artificial Intelligence and Control Engineering, CAICE 2024
Hypergraph contrastive learning for recommendation with side information
2024，International Journal of Intelligent Computing and Cybernetics
Multi-hop Neighbor Aggregator and Contrast Learning for Few-Shot Knowledge Graph Complete
2024，International Conference on Computer Vision, Robotics, and Automation Engineering, CRAE 2024
Pollen dictionary construction based on supervised comparative learning
2024，3rd International Conference on Computer, Artificial Intelligence and Control Engineering, CAICE 2024

Source ：

Year： 2024

Page： 739-744

Language： English

Cited Count：

WoS CC Cited Count：

SCOPUS Cited Count：

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 1

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search Engineering Village

Type
Departments

All Years Choose Year From to