• 综合
  • 标题
  • 关键词
  • 摘要
  • 学者
  • 期刊-刊名
  • 期刊-ISSN
  • 会议名称
搜索
高影响力成果及被引频次趋势图 关键词云图及合作者关系图

您的检索:

学者姓名:王鼎

精炼检索结果:

成果类型

应用 展开

语言

应用

清除所有精炼条件

排序方式:
默认
  • 默认
  • 标题
  • 年份
  • WOS被引数
  • 影响因子
  • 正序
  • 倒序
< 页,共 4 >
Decentralised tracking control based on critic learning for nonlinear disturbed interconnected systems SCIE
期刊论文 | 2023 , 54 (5) , 1150-1164 | INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE
WoS核心集被引次数: 4
摘要&关键词 引用

摘要 :

In this paper, the decentralised tracking control (DTC) problem is investigated for a class of continuous-time large-scale systems with external disturbance by utilising adaptive dynamic programming (ADP). Firstly, the DTC problem is solved by designing corresponding optimal controllers of the isolated subsystems, which are formulated with N augmented subsystems consisting of the tracking error and the reference trajectory. Then, considering the external disturbance, we can effectively construct the DTC scheme by means of adding suitable feedback gains to the optimal control strategies associated with each augmented tracking isolated subsystems (ATISs). Due to the approximate nature, a series of critic neural networks are constructed to solve the Hamilton-Jacobi-Isaacs equation, so as to derive the estimation of the Nash equilibrium solution containing the optimal control strategy and the worst disturbance law. Herein, a modified weight updating criterion is developed by employing a stabilising term. Consequently, we remove the requirement of initial admissible control in the proposed algorithm. After that, stability analysis of the ATIS is performed through the Lyapunov theory, in the sense that tracking states and weight approximation errors are uniformly ultimately bounded. Finally, an experimental simulation is demonstrated to ensure the validity of the proposed DTC scheme.

关键词 :

optimal control optimal control Adaptive dynamic programming (ADP) Adaptive dynamic programming (ADP) interconnected systems interconnected systems disturbance rejection disturbance rejection decentralised tracking control (DTC) decentralised tracking control (DTC) neural networks neural networks

引用:

复制并粘贴一种已设定好的引用格式,或利用其中一个链接导入到文献管理软件中。

GB/T 7714 Wang, Ding , Fan, Wenqian , Li, Menghua et al. Decentralised tracking control based on critic learning for nonlinear disturbed interconnected systems [J]. | INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE , 2023 , 54 (5) : 1150-1164 .
MLA Wang, Ding et al. "Decentralised tracking control based on critic learning for nonlinear disturbed interconnected systems" . | INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE 54 . 5 (2023) : 1150-1164 .
APA Wang, Ding , Fan, Wenqian , Li, Menghua , Qiao, Junfei . Decentralised tracking control based on critic learning for nonlinear disturbed interconnected systems . | INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE , 2023 , 54 (5) , 1150-1164 .
导入链接 NoteExpress RIS BibTex
Discounted linear Q-learning control with novel tracking cost and its stability SCIE
期刊论文 | 2023 , 626 , 339-353 | INFORMATION SCIENCES
WoS核心集被引次数: 8
摘要&关键词 引用

摘要 :

In this article, in order to achieve optimal tracking control of unknown linear discrete sys-tems, a model-free scheme based on Q-learning is established online. First, we introduce an innovative performance index function, so as to eliminate the tracking error and avert the calculation for stable control policies of the reference trajectory. Taking value iteration and policy iteration into consideration, the corresponding model-based approaches are derived. Then, the Q-function is developed and the model-free algorithm utilizing Q-learning is given for the sake of dealing with the linear quadratic tracking (LQT) problem online with-out relying on system dynamics information. In addition, novel stability analysis based on Q-learning is provided for the discounted LQT control issue and the probing noise is demonstrated that it does not result in any excitation noise bias. Finally, by means of con-ducting numerical simulation, the proposed Q-learning algorithm is demonstrated to be effective and practicable.(c) 2023 Elsevier Inc. All rights reserved.

关键词 :

Model -free control Model -free control Discounted linear quadratic tracking Discounted linear quadratic tracking Adaptive critic Adaptive critic Q-function Q-function Reinforcement learning Reinforcement learning

引用:

复制并粘贴一种已设定好的引用格式,或利用其中一个链接导入到文献管理软件中。

GB/T 7714 Wang, Ding , Ren, Jin , Ha, Mingming . Discounted linear Q-learning control with novel tracking cost and its stability [J]. | INFORMATION SCIENCES , 2023 , 626 : 339-353 .
MLA Wang, Ding et al. "Discounted linear Q-learning control with novel tracking cost and its stability" . | INFORMATION SCIENCES 626 (2023) : 339-353 .
APA Wang, Ding , Ren, Jin , Ha, Mingming . Discounted linear Q-learning control with novel tracking cost and its stability . | INFORMATION SCIENCES , 2023 , 626 , 339-353 .
导入链接 NoteExpress RIS BibTex
Dynamic Transfer Reference Point-Oriented MOEA/D Involving Local Objective-Space Knowledge SCIE
期刊论文 | 2022 , 26 (3) , 542-554 | IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION
WoS核心集被引次数: 23
摘要&关键词 引用

摘要 :

The decomposition-based evolutionary algorithm (MOEA/D) has attained excellent performance in solving optimization problems involving multiple conflicting objectives. However, the Pareto-optimal front (POF) of many multiobjective optimization problems (MOPs) has irregular properties, which weakens the performance of MOEA/D. To address this issue, we devise a dynamic transfer reference point-oriented MOEA/D with local objective-space knowledge (DTR-MOEA/D). The design principle is based on three original and rigorous mechanisms. First, the individuals are projected onto a line segment (two-objective case) or a 3-D plane (three-objective case) after being normalized in the objective space. The line segment or the plane is divided into three different regions: 1) the central region; 2) the middle region; and 3) the edge region. Second, a dynamic transfer criterion of the reference point is developed based on the population density relationships in different regions. Third, a strategy of population diversity enhancement guided by local objective-space knowledge is adopted to improve the diversity of the population. Finally, the experimental results conducted on 16 benchmark MOPs and eight modified MOPs with irregular POF shapes verify that the proposed DTR-MOEA/D has attained a strong competitiveness compared with other representative algorithms.

关键词 :

Shape Shape Pareto optimization Pareto optimization multiobjective optimization multiobjective optimization Optimization Optimization Decomposition Decomposition Statistics Statistics local objective space local objective space dynamic transfer reference point dynamic transfer reference point Convergence Convergence Sociology Sociology Optical fibers Optical fibers

引用:

复制并粘贴一种已设定好的引用格式,或利用其中一个链接导入到文献管理软件中。

GB/T 7714 Xie, Yingbo , Yang, Shengxiang , Wang, Ding et al. Dynamic Transfer Reference Point-Oriented MOEA/D Involving Local Objective-Space Knowledge [J]. | IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION , 2022 , 26 (3) : 542-554 .
MLA Xie, Yingbo et al. "Dynamic Transfer Reference Point-Oriented MOEA/D Involving Local Objective-Space Knowledge" . | IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION 26 . 3 (2022) : 542-554 .
APA Xie, Yingbo , Yang, Shengxiang , Wang, Ding , Qiao, Junfei , Yin, Baocai . Dynamic Transfer Reference Point-Oriented MOEA/D Involving Local Objective-Space Knowledge . | IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION , 2022 , 26 (3) , 542-554 .
导入链接 NoteExpress RIS BibTex
Neural critic learning for tracking control design of constrained nonlinear multi-person zero-sum games SCIE
期刊论文 | 2022 , 512 , 456-465 | NEUROCOMPUTING
WoS核心集被引次数: 6
摘要&关键词 引用

摘要 :

In this paper, an adaptive critic method based on neural networks is established to solve the tracking con-trol problem for multi-person zero-sum games with constrained nonlinear dynamics. First, an augmented system is constructed with the tracking error system and the reference system, an appropriate function is introduced to handle the constrained problem, and a constrained tracking Hamilton-Jacobi-Isaacs (HJI) equation is derived for the augmented system. Then, a constrained tracking design with neural critic learning for multi-person zero-sum games is developed to approximately solve the tracking HJI equation with input constraints. A new updating rule is given and only one critic network is employed during neural critic learning. In addition, we prove that the tracking error in the augmented system is uniformly ulti-mately bounded by using Lyapunov's direct method. Finally, an example is given to verify the effectiveness of the proposed method. In this example, we make the number of control inputs less than the number of disturbance inputs. (C) 2022 Elsevier B.V. All rights reserved.

关键词 :

Multi -person zero -sum games Multi -person zero -sum games Adaptive dynamic programming Adaptive dynamic programming Neural critic learning Neural critic learning Constrained tracking control Constrained tracking control

引用:

复制并粘贴一种已设定好的引用格式,或利用其中一个链接导入到文献管理软件中。

GB/T 7714 Li, Menghua , Wang, Ding , Qiao, Junfei . Neural critic learning for tracking control design of constrained nonlinear multi-person zero-sum games [J]. | NEUROCOMPUTING , 2022 , 512 : 456-465 .
MLA Li, Menghua et al. "Neural critic learning for tracking control design of constrained nonlinear multi-person zero-sum games" . | NEUROCOMPUTING 512 (2022) : 456-465 .
APA Li, Menghua , Wang, Ding , Qiao, Junfei . Neural critic learning for tracking control design of constrained nonlinear multi-person zero-sum games . | NEUROCOMPUTING , 2022 , 512 , 456-465 .
导入链接 NoteExpress RIS BibTex
System Stability of Learning-Based Linear Optimal Control With General Discounted Value Iteration SCIE
期刊论文 | 2022 , 34 (9) , 6504-6514 | IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS
WoS核心集被引次数: 41
摘要&关键词 引用

摘要 :

For discounted optimal regulation design, the stability of the controlled system is affected by the discount factor. If an inappropriate discount factor is employed, the optimal control policy might be unstabilizing. Therefore, in this article, the effect of the discount factor on the stabilization of control strategies is discussed. We develop the system stability criterion and the selection rules of the discount factor with respect to the linear quadratic regulator problem under the general discounted value iteration algorithm. Based on the monotonicity of the value function sequence, the method to judge the stability of the controlled system is established during the iteration process. In addition, once some stability conditions are satisfied at a certain iteration step, all control policies after this iteration step are stabilizing. Furthermore, combined with the undiscounted optimal control problem, the practical rule of how to select an appropriate discount factor is constructed. Finally, several simulation examples with physical backgrounds are conducted to demonstrate the present theoretical results.

关键词 :

Regulators Regulators Stability criteria Stability criteria reinforcement learning (RL) reinforcement learning (RL) discount factor discount factor Costs Costs Adaptive critic design Adaptive critic design optimal control optimal control Asymptotic stability Asymptotic stability stability stability Heuristic algorithms Heuristic algorithms value iteration (VI) value iteration (VI) Optimal control Optimal control linear quadratic regulator (LQR) linear quadratic regulator (LQR) Cost function Cost function

引用:

复制并粘贴一种已设定好的引用格式,或利用其中一个链接导入到文献管理软件中。

GB/T 7714 Wang, Ding , Ren, Jin , Ha, Mingming et al. System Stability of Learning-Based Linear Optimal Control With General Discounted Value Iteration [J]. | IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS , 2022 , 34 (9) : 6504-6514 .
MLA Wang, Ding et al. "System Stability of Learning-Based Linear Optimal Control With General Discounted Value Iteration" . | IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 34 . 9 (2022) : 6504-6514 .
APA Wang, Ding , Ren, Jin , Ha, Mingming , Qiao, Junfei . System Stability of Learning-Based Linear Optimal Control With General Discounted Value Iteration . | IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS , 2022 , 34 (9) , 6504-6514 .
导入链接 NoteExpress RIS BibTex
The intelligent critic framework for advanced optimal control SCIE
期刊论文 | 2022 , 55 (1) , 1-22 | ARTIFICIAL INTELLIGENCE REVIEW
WoS核心集被引次数: 124
摘要&关键词 引用

摘要 :

The idea of optimization can be regarded as an important basis of many disciplines and hence is extremely useful for a large number of research fields, particularly for artificial-intelligence-based advanced control design. Due to the difficulty of solving optimal control problems for general nonlinear systems, it is necessary to establish a kind of novel learning strategies with intelligent components. Besides, the rapid development of computer and networked techniques promotes the research on optimal control within discrete-time domain. In this paper, the bases, the derivation, and recent progresses of critic intelligence for discrete-time advanced optimal control design are presented with an emphasis on the iterative framework. Among them, the so-called critic intelligence methodology is highlighted, which integrates learning approximators and the reinforcement formulation.

关键词 :

Advanced optimal control Advanced optimal control Intelligent critic Intelligent critic Dynamic systems Dynamic systems

引用:

复制并粘贴一种已设定好的引用格式,或利用其中一个链接导入到文献管理软件中。

GB/T 7714 Wang, Ding , Ha, Mingming , Zhao, Mingming . The intelligent critic framework for advanced optimal control [J]. | ARTIFICIAL INTELLIGENCE REVIEW , 2022 , 55 (1) : 1-22 .
MLA Wang, Ding et al. "The intelligent critic framework for advanced optimal control" . | ARTIFICIAL INTELLIGENCE REVIEW 55 . 1 (2022) : 1-22 .
APA Wang, Ding , Ha, Mingming , Zhao, Mingming . The intelligent critic framework for advanced optimal control . | ARTIFICIAL INTELLIGENCE REVIEW , 2022 , 55 (1) , 1-22 .
导入链接 NoteExpress RIS BibTex
A Novel Value Iteration Scheme With Adjustable Convergence Rate SCIE
期刊论文 | 2022 , 34 (10) , 7430-7442 | IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS
WoS核心集被引次数: 31
摘要&关键词 引用

摘要 :

In this article, a novel value iteration scheme is developed with convergence and stability discussions. A relaxation factor is introduced to adjust the convergence rate of the value function sequence. The convergence conditions with respect to the relaxation factor are given. The stability of the closed-loop system using the control policies generated by the present VI algorithm is investigated. Moreover, an integrated VI approach is developed to accelerate and guarantee the convergence by combining the advantages of the present and traditional value iterations. Also, a relaxation function is designed to adaptively make the developed value iteration scheme possess fast convergence property. Finally, the theoretical results and the effectiveness of the present algorithm are validated by numerical examples.

关键词 :

Numerical stability Numerical stability reinforcement learning (RL) reinforcement learning (RL) Stability criteria Stability criteria Adaptive dynamic programming (ADP) Adaptive dynamic programming (ADP) discrete-time nonlinear systems discrete-time nonlinear systems value iteration value iteration convergence rate convergence rate Heuristic algorithms Heuristic algorithms Approximation algorithms Approximation algorithms Optimal control Optimal control Convergence Convergence admissible control policy admissible control policy Iterative algorithms Iterative algorithms

引用:

复制并粘贴一种已设定好的引用格式,或利用其中一个链接导入到文献管理软件中。

GB/T 7714 Ha, Mingming , Wang, Ding , Liu, Derong . A Novel Value Iteration Scheme With Adjustable Convergence Rate [J]. | IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS , 2022 , 34 (10) : 7430-7442 .
MLA Ha, Mingming et al. "A Novel Value Iteration Scheme With Adjustable Convergence Rate" . | IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 34 . 10 (2022) : 7430-7442 .
APA Ha, Mingming , Wang, Ding , Liu, Derong . A Novel Value Iteration Scheme With Adjustable Convergence Rate . | IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS , 2022 , 34 (10) , 7430-7442 .
导入链接 NoteExpress RIS BibTex
An event-triggered neural critic technique for nonzero-sum game design with control constraints SCIE
期刊论文 | 2022 , 54 (2) , 237-250 | INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE
WoS核心集被引次数: 1
摘要&关键词 引用

摘要 :

In this paper, an event-triggered neural critic learning algorithm is investigated to address constrained nonzero-sum game problems with discrete-time nonaffine dynamics. First, in order to ensure the saturation independence of two controllers in the nonzero-sum game problem, we adopt two different boundaries to constrain them respectively. Then, a novel triggering condition is designed to reduce the update times of the controllers, which achieves the purpose of less calculation. It is emphasised that the triggering condition is established based on the iteration of the time-triggered mechanism. Meanwhile, we prove that the real cost function possesses a predetermined upper bound, which realises the cost guarantee of the controlled system. In addition, we prove that the closed-loop system using the developed algorithm is asymptotically stable and that the system state and the sampling state are uniformly ultimately bounded during the process of training neural networks. Finally, two simulation examples are conducted to demonstrate the effectiveness of the proposed algorithm.

关键词 :

Adaptive critic technique Adaptive critic technique optimal control optimal control neural networks neural networks nonaffine systems nonaffine systems event-triggered control event-triggered control constrained control constrained control nonzero-sum games nonzero-sum games

引用:

复制并粘贴一种已设定好的引用格式,或利用其中一个链接导入到文献管理软件中。

GB/T 7714 Hu, Lingzhi , Wang, Ding , Ren, Jin et al. An event-triggered neural critic technique for nonzero-sum game design with control constraints [J]. | INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE , 2022 , 54 (2) : 237-250 .
MLA Hu, Lingzhi et al. "An event-triggered neural critic technique for nonzero-sum game design with control constraints" . | INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE 54 . 2 (2022) : 237-250 .
APA Hu, Lingzhi , Wang, Ding , Ren, Jin , Wang, Jiangyu , Qiao, Junfei . An event-triggered neural critic technique for nonzero-sum game design with control constraints . | INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE , 2022 , 54 (2) , 237-250 .
导入链接 NoteExpress RIS BibTex
Multi-event-triggered adaptive critic control with guaranteed cost for discrete-time nonlinear nonzero-sum games SCIE
期刊论文 | 2022 , 32 (18) , 10292-10308 | INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL
WoS核心集被引次数: 3
摘要&关键词 引用

摘要 :

In this article, a new event-based adaptive critic algorithm with multiple triggering conditions is investigated to address multi-player nonzero-sum game problems for discrete-time nonlinear dynamics. In order to improve resource utilization while ensure mutual independence among players, the corresponding novel triggering conditions are designed for each player. The corresponding control input is updated only when the relevant triggering condition is violated. It is emphasized that these triggering conditions are established based on the iteration of the time-triggered mechanism. Then, according to the setting triggering conditions, we prove that the real cost function possesses a predetermined upper bound, which realizes the cost guarantee of the controlled system. Additionally, the multi-player closed-loop system is proved to be asymptotically stable and the multi-event-triggered control method is implemented by constructing three kinds of neural networks. Finally, the effectiveness of the developed multi-event-triggered control approach is verified through conducting two simulation examples.

关键词 :

neural networks neural networks adaptive critic adaptive critic multi-player games multi-player games guaranteed cost guaranteed cost multi-event-triggered control multi-event-triggered control nonlinear control nonlinear control

引用:

复制并粘贴一种已设定好的引用格式,或利用其中一个链接导入到文献管理软件中。

GB/T 7714 Wang, Ding , Hu, Lingzhi , Qiao, Junfei . Multi-event-triggered adaptive critic control with guaranteed cost for discrete-time nonlinear nonzero-sum games [J]. | INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL , 2022 , 32 (18) : 10292-10308 .
MLA Wang, Ding et al. "Multi-event-triggered adaptive critic control with guaranteed cost for discrete-time nonlinear nonzero-sum games" . | INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL 32 . 18 (2022) : 10292-10308 .
APA Wang, Ding , Hu, Lingzhi , Qiao, Junfei . Multi-event-triggered adaptive critic control with guaranteed cost for discrete-time nonlinear nonzero-sum games . | INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL , 2022 , 32 (18) , 10292-10308 .
导入链接 NoteExpress RIS BibTex
Dual Event-Triggered Constrained Control Through Adaptive Critic for Discrete-Time Zero-Sum Games SCIE
期刊论文 | 2022 , 53 (3) , 1584-1595 | IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS
WoS核心集被引次数: 75
摘要&关键词 引用

摘要 :

In this article, through adaptive critic, a dual event-triggered (DET) constrained control scheme is established for discrete-time nonlinear zero-sum games. The neural networks are trained from the dual heuristic dynamic programming technique to obtain the approximate optimal policy pair. Two corresponding independent triggering conditions are constructed for the control input and the disturbance to improve the utilization efficiency and ensure the independence between them. In addition, in order to overcome the challenge caused by the actuator saturation, we constrain the control input to a bounded range. Meanwhile, the asymptotically stability is proved for the DET control system. Finally, experimental simulations are conducted to verify the effectiveness of the proposed algorithm.

关键词 :

Control systems Control systems Neural networks Neural networks optimal control optimal control Stability analysis Stability analysis Iterative methods Iterative methods Games Games dual event-triggered (DET) control dual event-triggered (DET) control neural networks neural networks zero-sum games (ZSGs) zero-sum games (ZSGs) iterative adaptive critic iterative adaptive critic Adaptive systems Adaptive systems Numerical stability Numerical stability Discrete-time nonlinear systems Discrete-time nonlinear systems

引用:

复制并粘贴一种已设定好的引用格式,或利用其中一个链接导入到文献管理软件中。

GB/T 7714 Wang, Ding , Hu, Lingzhi , Zhao, Mingming et al. Dual Event-Triggered Constrained Control Through Adaptive Critic for Discrete-Time Zero-Sum Games [J]. | IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS , 2022 , 53 (3) : 1584-1595 .
MLA Wang, Ding et al. "Dual Event-Triggered Constrained Control Through Adaptive Critic for Discrete-Time Zero-Sum Games" . | IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS 53 . 3 (2022) : 1584-1595 .
APA Wang, Ding , Hu, Lingzhi , Zhao, Mingming , Qiao, Junfei . Dual Event-Triggered Constrained Control Through Adaptive Critic for Discrete-Time Zero-Sum Games . | IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS , 2022 , 53 (3) , 1584-1595 .
导入链接 NoteExpress RIS BibTex
每页显示 10| 20| 50 条结果
< 页,共 4 >

导出

数据:

选中

格式:
在线人数/总访问数:90/4642598
地址:北京工业大学图书馆(北京市朝阳区平乐园100号 邮编:100124) 联系我们:010-67392185
版权所有:北京工业大学图书馆 站点建设与维护:北京爱琴海乐之技术有限公司