Optimal Synchronization Control of Heterogeneous Asymmetric Input-Constrained Unknown Nonlinear MASs via Reinforcement Learning|Lina Xia;Qing Li;Ruizhuo Song;Hamidreza Modares|Department of Mechanical Engineering,Michigan State University,East Lansing,MI 48824 USA - 期刊导航|首站-论文投稿智能助手|论文发表|论文智能投稿|期刊自助发表推荐|杂志社快速发表|查同导刊-域田数据官方网站

典型文献

Optimal Synchronization Control of Heterogeneous Asymmetric Input-Constrained Unknown Nonlinear MASs via Reinforcement Learning

文献摘要：

The asymmetric input-constrained optimal synch-ronization problem of heterogeneous unknown nonlinear multiagent systems (MASs) is considered in the paper. Intuitively, a state-space transformation is performed such that satisfaction of symmetric input constraints for the transformed system guarantees satisfaction of asymmetric input constraints for the original system. Then, considering that the leader's information is not available to every follower, a novel distributed observer is designed to estimate the leader's state using only exchange of information among neighboring followers. After that, a network of augmented systems is constructed by combining observers and followers dynamics. A nonquadratic cost function is then leveraged for each augmented system (agent) for which its optimization satisfies input constraints and its corresponding constrained Hamilton-Jacobi-Bellman (HJB) equation is solved in a data-based fashion. More specifically, a data-based off-policy reinforcement learning (RL) algorithm is presented to learn the solution to the constrained HJB equation without requiring the complete knowledge of the agents' dynamics. Convergence of the improved RL algorithm to the solution to the constrained HJB equation is also demonstrated. Finally, the correctness and validity of the theoretical results are demonstrated by a simulation example.

文献关键词：

中图分类号：

[1] 医药、卫生（R） / 药学（R9） / 药理学（R96） / 实验药理学（R965）

[2] 自动化技术、计算机技术（TP） / 计算技术、计算机技术（TP3） / 计算机的应用（TP39） / 信息处理(信息加工)（TP391）

[3] 医药、卫生（R） / 基础医学（R3） / 病理学（R36） / 病理过程（R364）

作者姓名：

Lina Xia;Qing Li;Ruizhuo Song;Hamidreza Modares

作者机构：

Beijing Engineering Research Center of Industrial Spectrum Imaging,School of Automation and Electrical Engineering,University of Science and Technology Beijing,Beijing 100083,China;Department of Mechanical Engineering,Michigan State University,East Lansing,MI 48824 USA

文献出处：

自动化学报（英文版）

引用格式：

[1]Lina Xia;Qing Li;Ruizhuo Song;Hamidreza Modares-.Optimal Synchronization Control of Heterogeneous Asymmetric Input-Constrained Unknown Nonlinear MASs via Reinforcement Learning)[J].自动化学报（英文版）,2022(03):520-532

A类：

synch,ronization,multiagent,Intuitively,nonquadratic

B类：

Optimal,Synchronization,Control,Heterogeneous,Asymmetric,Input,Constrained,Unknown,Nonlinear,MASs,via,Reinforcement,Learning,asymmetric,input,constrained,optimal,problem,heterogeneous,unknown,nonlinear,systems,considered,paper,state,space,transformation,performed,such,that,satisfaction,constraints,transformed,guarantees,original,Then,considering,leader,information,not,available,every,novel,distributed,designed,estimate,using,only,exchange,among,neighboring,followers,After,network,augmented,constructed,by,combining,observers,dynamics,cost,function,then,leveraged,each,which,its,optimization,satisfies,corresponding,Hamilton,Jacobi,Bellman,HJB,equation,solved,data,fashion,More,specifically,off,policy,reinforcement,learning,RL,algorithm,presented,solution,without,requiring,complete,knowledge,agents,Convergence,improved,also,demonstrated,Finally,correctness,validity,theoretical,results,are,simulation,example

AB值：

0.586825

相似文献

Performance Analysis of Sparse Array based Massive MIMO via Joint Convex Optimization

Mengting Lou;Jing Jin;Hanning Wang;Dan Wu;Liang Xia;Qixing Wang;Yifei Yuan;Jiangzhou Wang-Future Research Lab,China Mobile Research Institute,Beijing 100053,China;School of Engineering and Digital Arts,University of Kent,Canterbury,U.K.

Joint Access Point Selection and Resource Allocation in MEC-Assisted Network:A Reinforcement Learning Based Approach

Zexu Li;Chunjing Hu;Wenbo Wang;Yong Li;Guiming Wei-Key Laboratory of Universal Wireless Communications,Beijing University of Posts and Telecommunications,Beijing 100876,China;China Academy of Information and Communications Technology,Beijing 100191,China

MEC Enabled Cooperative Sensing and Resource Allocation for Industrial IoT Systems

Yanpeng Dai;Lihong Zhao;Ling Lyu-School of Information Science and Technology,Dalian Maritime University,Dalian 116086,China;State Key Laboratory of Integrated Services Networks,Xidian University,Shaanxi 710071,China

Efficient Multi-User for Task Offloading and Server Allocation in Mobile Edge Computing Systems

Qiuming Liu;Jing Li;Jianming Wei;Ruoxuan Zhou;Zheng Chai;Shumin Liu-School of Software Engineering,Jiangxi University of Science and Technology,Nanchang 330013,China;Nanchang Key laboratory of Virtual Digital Factory and Cultural Communications,Nanchang 330013,China

A Multi-Agent Reinforcement Learning-Based Collaborative Jamming System:Algorithm Design and Software-Defined Radio Implementation