咨询与建议

限定检索结果

文献类型

  • 31 篇 期刊文献
  • 5 篇 会议

馆藏范围

  • 36 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 34 篇 工学
    • 23 篇 计算机科学与技术...
    • 21 篇 控制科学与工程
    • 20 篇 软件工程
    • 7 篇 仪器科学与技术
    • 6 篇 电子科学与技术(可...
    • 5 篇 机械工程
    • 5 篇 信息与通信工程
    • 2 篇 电气工程
    • 2 篇 航空宇航科学与技...
    • 1 篇 交通运输工程
    • 1 篇 兵器科学与技术
    • 1 篇 环境科学与工程(可...
    • 1 篇 公安技术
  • 20 篇 管理学
    • 20 篇 管理科学与工程(可...
    • 1 篇 工商管理
  • 2 篇 理学
    • 2 篇 数学
  • 2 篇 艺术学
    • 2 篇 设计学(可授艺术学...
  • 1 篇 经济学
    • 1 篇 应用经济学
  • 1 篇 教育学
    • 1 篇 心理学(可授教育学...
  • 1 篇 军事学
    • 1 篇 战术学
    • 1 篇 军队指挥学

主题

  • 36 篇 multi-agent rein...
  • 2 篇 reinforcement le...
  • 2 篇 trajectory plann...
  • 2 篇 deep reinforceme...
  • 1 篇 software-defined...
  • 1 篇 internet of thin...
  • 1 篇 covid-19
  • 1 篇 vehicle dynamics
  • 1 篇 task migration
  • 1 篇 molecule design
  • 1 篇 cognitive consis...
  • 1 篇 multi-agent coop...
  • 1 篇 service function...
  • 1 篇 network function...
  • 1 篇 distributed robu...
  • 1 篇 confidence learn...
  • 1 篇 multi-view learn...
  • 1 篇 large-scale coll...
  • 1 篇 markov game
  • 1 篇 task relationshi...

机构

  • 2 篇 national key lab...
  • 2 篇 polixir technolo...
  • 1 篇 key laboratory o...
  • 1 篇 university of sc...
  • 1 篇 school of electr...
  • 1 篇 college of softw...
  • 1 篇 college of compu...
  • 1 篇 ucrd chandigarh ...
  • 1 篇 state key labora...
  • 1 篇 school of softwa...
  • 1 篇 department of au...
  • 1 篇 department of bu...
  • 1 篇 chandigarh colle...
  • 1 篇 symbiosis centre...
  • 1 篇 peng cheng labor...
  • 1 篇 department of co...
  • 1 篇 school of commun...
  • 1 篇 school of mathem...
  • 1 篇 software enginee...
  • 1 篇 suzhou joint gra...

作者

  • 2 篇 lei yuan
  • 2 篇 feng chen
  • 2 篇 zongzhang zhang
  • 2 篇 yang yu
  • 1 篇 wei hu
  • 1 篇 jiadong yu
  • 1 篇 ya zhang
  • 1 篇 sudhakar kumar
  • 1 篇 xiaotie deng
  • 1 篇 bin hu
  • 1 篇 chang cyoon lim
  • 1 篇 ronghao zheng
  • 1 篇 dong peng
  • 1 篇 xinwei yuan
  • 1 篇 dong yubo
  • 1 篇 jun wang
  • 1 篇 hang xiao
  • 1 篇 dengpeng xing
  • 1 篇 yang shen
  • 1 篇 haibin cai

语言

  • 34 篇 英文
  • 2 篇 中文
检索条件"主题词=multi-agent reinforcement learning"
36 条 记 录,以下是21-30 订阅
Exploring Local Chemical Space in De Novo Molecular Generation Using multi-agent Deep reinforcement learning
收藏 引用
Natural Science 2021年 第9期13卷 412-424页
作者: Wei Hu Department of Computer Science Houghton College Houghton NY USA
Single-agent reinforcement learning (RL) is commonly used to learn how to play computer games, in which the agent makes one move before making the next in a sequential decision process. Recently single agent was also ... 详细信息
来源: 维普期刊数据库 维普期刊数据库 评论
multi-agent Dynamic Area Coverage Based on reinforcement learning with Connected agents
收藏 引用
Computer Systems Science & Engineering 2023年 第4期45卷 215-230页
作者: Fatih Aydemir Aydin Cetin STM Defence Technologies Engineering and Trade.Inc. Ankara06560Turkey Department of Computer Engineering Faculty of TechnologyGazi UniversityAnkara06500Turkey
Dynamic area coverage with small unmanned aerial vehicle(UAV)systems is one of the major research topics due to limited payloads and the difficulty of decentralized decision-making *** behavior of a group of UAVs in a... 详细信息
来源: 维普期刊数据库 维普期刊数据库 评论
A Weighted Mean Field reinforcement learning Algorithm for Large-Scale multi-agent Collaboration
收藏 引用
Guidance, Navigation and Control 2023年 第2期3卷 38-56页
作者: Xinwei Yuan He Wang Wenwu Yu Suzhou Joint Graduate School Southeast University SuzhouJiangsu Province 215123P.R.China School of Mathematics Southeast University NanjingJiangsu Province 210096P.R.China
reinforcement learning has been proven to be an effective approach for solving multi-agent coordination problems in a dynamic open *** dealing with multi-agent cooper-ation issues,the mean field multi-agent reinforcem... 详细信息
来源: 维普期刊数据库 维普期刊数据库 同方期刊数据库 同方期刊数据库 评论
multi-agent Hierarchical Graph Attention reinforcement learning for Grid-Aware Energy Management
收藏 引用
ZTE Communications 2023年 第3期21卷 11-21页
作者: FENG Bingyi FENG Mingxiao WANG Minrui ZHOU Wengang LI Houqiang University of Science and Technology of China Hefei 230026China
The increasing adoption of renewable energy has posed challenges for voltage regulation in power distribution *** energy management,which includes the control of smart inverters and energy management systems,is a tren... 详细信息
来源: 维普期刊数据库 维普期刊数据库 同方期刊数据库 同方期刊数据库 评论
Underwater multi-agent Cooperative Formation Hunting Based on Deep reinforcement learning
Underwater Multi-agent Cooperative Formation Hunting Based o...
收藏 引用
第43届中国控制会议
作者: Xiaobo Shi Meiqin Liu Shanling Dong Ronghao Zheng Ping Wei College of Electrical Engineering Zhejiang University
In addressing the issue of formation hunting and trajectory planning for multi-autonomous underwater vehicles(AUVs) in complex underwater environments, traditional virtual structure algorithms, and leader-follower mod... 详细信息
来源: cnki会议 评论
multi-agent Robust Time Differential reinforcement learning over communicated networks
Multi-agent Robust Time Differential Reinforcement Learning ...
收藏 引用
第37届中国控制会议
作者: Jiahong Li Nan Ma Xiangmin Han School of Robotics Beijing Union University
Recently, the researches on multi-agent reinforcement learning(MARL) have attracted tremendous interest in many applications, especially for autonomous driving. The main problem of MARL is how to deal with the uncer... 详细信息
来源: cnki会议 评论
Policy evaluation for reinforcement learning over asynchronous multi-agent networks
Policy evaluation for reinforcement learning over asynchrono...
收藏 引用
第40届中国控制会议
作者: Xingyu Sha Jiaqi Zhang Keyou You Department of Automation and BNRist Tsinghua University
This paper proposes a fully asynchronous algorithm for policy evaluation of multi-agent reinforcement learning over networks. Without any form of coordination, agents can communicate with neighbors and compute their l... 详细信息
来源: cnki会议 评论
An Overview of Intelligent Wireless Communications Using Deep reinforcement learning
收藏 引用
Journal of Communications and Information Networks 2019年 第2期4卷 15-29页
作者: Yongming Huang Chunmei Xu Cheng Zhang Meng Hua Zhengming Zhang School of Information Science and Engineering Southeast UniversityNanjing 210096China Purple Mountain Laboratories Nanjing 211111China
Future wireless communication networks tend to be intelligentized to accomplish the missions that cannot be *** the new intelligent communication systems,optimizing the network perfor-mance has become a challenge due ... 详细信息
来源: 维普期刊数据库 维普期刊数据库 评论
Intelligent logistics system of steel bar warehouse based on ubiquitous information
收藏 引用
International Journal of Minerals,Metallurgy and Materials 2021年 第8期28卷 1367-1377页
作者: Hai-nan He Xiao-chen Wang Gong-zhuang Peng Dong Xu Yang Liu Min Jiang Ze-dong Wu Da Zhang He Yan National Engineering Research Center for Advanced Rolling Technology University of Science and Technology BeijingBeijing 100083China
Internet of Things and artificial intelligence technology are the key elements of the intelligent construction of iron and steel production warehouse. This paper puts forward a whole set of intelligent scheme for bar ... 详细信息
来源: 维普期刊数据库 维普期刊数据库 同方期刊数据库 同方期刊数据库 评论
On the complexity of computing Markov perfect equilibrium in general-sum stochastic games
收藏 引用
National Science Review 2023年 第1期10卷 288-301页
作者: Xiaotie Deng Ningyuan Li David Mguni Jun Wang Yaodong Yang Center on Frontiers of Computing Studies School of Computer Science Peking University Center for multi-agent Research Institute for AI Peking University Huawei UK Computer Science University College London
Similar to the role of Markov decision processes in reinforcement learning,Markov games(also called stochastic games) lay down the foundation for the study of multi-agent reinforcement learning and se quential agent *... 详细信息
来源: 同方期刊数据库 同方期刊数据库 评论