• 综合
  • 标题
  • 关键词
  • 摘要
  • 学者
  • 期刊-刊名
  • 期刊-ISSN
  • 会议名称
搜索

作者:

Wang, Jiangyu (Wang, Jiangyu.) | Wang, Ding (Wang, Ding.) | Zhao, Mingming (Zhao, Mingming.) | Qiao, Junfei (Qiao, Junfei.)

收录:

EI Scopus

摘要:

When facing large amounts of data, it is a challenging task to optimize policies by using all data at once. In this paper, a data-driven Q-learning scheme with parallel multi-step deduction is developed to improve learning efficiency using small batch data for discrete-time nonlinear control. Specifically, a data-driven model is established by making use of all data in advance. Then, the proposed algorithm can parallel deduce the small batch data to effectively accelerate the learning process. Furthermore, we can adjust the step size of multi-step deduction to balance the utilization between data and model. The near-optimal policy can be obtained ultimately by using hybrid data from the real system and data-driven model. Finally, a torsional pendulum plant is given to demonstrate the effectiveness of the proposed method. © 2024 IEEE.

关键词:

Federated learning Adversarial machine learning Contrastive Learning

作者机构:

  • [ 1 ] [Wang, Jiangyu]Faculty of Information Technology, Beijing University of Technology, Beijing; 100124, China
  • [ 2 ] [Wang, Jiangyu]Beijing University of Technology, Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing; 100124, China
  • [ 3 ] [Wang, Jiangyu]Beijing Institute of Artificial Intelligence, Beijing University of Technology, Beijing; 100124, China
  • [ 4 ] [Wang, Jiangyu]Beijing University of Technology, Beijing Laboratory of Smart Environmental Protection, Beijing; 100124, China
  • [ 5 ] [Wang, Ding]Faculty of Information Technology, Beijing University of Technology, Beijing; 100124, China
  • [ 6 ] [Wang, Ding]Beijing University of Technology, Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing; 100124, China
  • [ 7 ] [Wang, Ding]Beijing Institute of Artificial Intelligence, Beijing University of Technology, Beijing; 100124, China
  • [ 8 ] [Wang, Ding]Beijing University of Technology, Beijing Laboratory of Smart Environmental Protection, Beijing; 100124, China
  • [ 9 ] [Zhao, Mingming]Faculty of Information Technology, Beijing University of Technology, Beijing; 100124, China
  • [ 10 ] [Zhao, Mingming]Beijing University of Technology, Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing; 100124, China
  • [ 11 ] [Zhao, Mingming]Beijing Institute of Artificial Intelligence, Beijing University of Technology, Beijing; 100124, China
  • [ 12 ] [Zhao, Mingming]Beijing University of Technology, Beijing Laboratory of Smart Environmental Protection, Beijing; 100124, China
  • [ 13 ] [Qiao, Junfei]Faculty of Information Technology, Beijing University of Technology, Beijing; 100124, China
  • [ 14 ] [Qiao, Junfei]Beijing University of Technology, Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing; 100124, China
  • [ 15 ] [Qiao, Junfei]Beijing Institute of Artificial Intelligence, Beijing University of Technology, Beijing; 100124, China
  • [ 16 ] [Qiao, Junfei]Beijing University of Technology, Beijing Laboratory of Smart Environmental Protection, Beijing; 100124, China

通讯作者信息:

电子邮件地址:

查看成果更多字段

相关关键词:

相关文章:

来源 :

年份: 2024

页码: 739-744

语种: 英文

被引次数:

WoS核心集被引频次:

SCOPUS被引频次:

ESI高被引论文在榜: 0 展开所有

万方被引频次:

中文被引频次:

近30日浏览量: 0

归属院系:

在线人数/总访问数:683/5059841
地址:北京工业大学图书馆(北京市朝阳区平乐园100号 邮编:100124) 联系我们:010-67392185
版权所有:北京工业大学图书馆 站点建设与维护:北京爱琴海乐之技术有限公司