咨询与建议

限定检索结果

文献类型

  • 6 篇 期刊文献
  • 4 篇 会议

馆藏范围

  • 10 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 10 篇 工学
    • 7 篇 计算机科学与技术...
    • 5 篇 仪器科学与技术
    • 5 篇 控制科学与工程
    • 5 篇 软件工程
    • 4 篇 机械工程
    • 3 篇 信息与通信工程
    • 2 篇 电气工程
    • 2 篇 电子科学与技术(可...
    • 1 篇 动力工程及工程热...
    • 1 篇 网络空间安全
  • 6 篇 管理学
    • 6 篇 管理科学与工程(可...
    • 1 篇 图书情报与档案管...
  • 3 篇 理学
    • 2 篇 数学
    • 1 篇 系统科学
  • 1 篇 文学
    • 1 篇 外国语言文学

主题

  • 10 篇 reward function
  • 1 篇 reinforcement le...
  • 1 篇 deep q-learning(...
  • 1 篇 six-dof arm robo...
  • 1 篇 electric vehicle...
  • 1 篇 scada data
  • 1 篇 robotic pushing
  • 1 篇 markov decision ...
  • 1 篇 multi-agents dee...
  • 1 篇 optimization ope...
  • 1 篇 learning methods
  • 1 篇 robotic grasping
  • 1 篇 artificial intel...
  • 1 篇 genetic algorith...
  • 1 篇 convolutional ne...
  • 1 篇 deep reinforceme...
  • 1 篇 heterogeneous wi...
  • 1 篇 ddpg
  • 1 篇 dnn
  • 1 篇 wind power

机构

  • 1 篇 state key labora...
  • 1 篇 special environm...
  • 1 篇 school of electr...
  • 1 篇 school of automa...
  • 1 篇 r&d state grid i...
  • 1 篇 school of contro...
  • 1 篇 college of autom...
  • 1 篇 school of aerona...
  • 1 篇 school of electr...
  • 1 篇 university of ji...
  • 1 篇 beijing institut...
  • 1 篇 school of electr...
  • 1 篇 school of comput...

作者

  • 1 篇 zesan liu
  • 1 篇 shao zhifei
  • 1 篇 dong peng
  • 1 篇 dong yubo
  • 1 篇 jing zhang
  • 1 篇 yuanchen jiang
  • 1 篇 zhou yufan
  • 1 篇 junjun zhang
  • 1 篇 zhihao ni
  • 1 篇 wen tan
  • 1 篇 jianwei zhang
  • 1 篇 jiaohui xu
  • 1 篇 yuxiang yang
  • 1 篇 manlu liu
  • 1 篇 xinmao li
  • 1 篇 haitao ding
  • 1 篇 jin cheng
  • 1 篇 fei xia
  • 1 篇 jianli xie
  • 1 篇 cui tao

语言

  • 10 篇 英文
检索条件"主题词=Reward function"
10 条 记 录,以下是1-10 订阅
排序:
reward function Design Method for Long Episode Pursuit Tasks Under Polar Coordinate in Multi-Agent Reinforcement Learning
收藏 引用
Journal of Shanghai Jiaotong university(Science) 2024年 第4期29卷 646-655页
作者: DONG Yubo CUI Tao ZHOU Yufan SONG Xun ZHU Yue DONG Peng School of Aeronautics and Astronautics Shanghai Jiao Tong UniversityShanghai200240China Beijing Institute of Electronic System Engineering Beijing100854China
Multi-agent reinforcement learning has recently been applied to solve pursuit ***,it suffers from a large number of time steps per training episode,thus always struggling to converge effectively,resulting in low rewar... 详细信息
来源: 维普期刊数据库 维普期刊数据库 评论
Collaborative Pushing and Grasping of Tightly Stacked Objects via Deep Reinforcement Learning
收藏 引用
IEEE/CAA Journal of Automatica Sinica 2022年 第1期9卷 135-145页
作者: Yuxiang Yang Zhihao Ni Mingyu Gao Jing Zhang Dacheng Tao School of Electronics and Information Hangzhou Dianzi UniversityHangzhouand also with Zhejiang Provincial Key Laboratory of Equipment ElectronicsHangzhou 310018China School of Computer Science Faculty of EngineeringUniversity of SydneyDarlingtonNSW 2006Australia JD Explore Academy ***Beijing 101111China
Directly grasping the tightly stacked objects may cause collisions and result in failures,degenerating the functionality of robotic *** by the observation that first pushing objects to a state of mutual separation and... 详细信息
来源: 维普期刊数据库 维普期刊数据库 同方期刊数据库 同方期刊数据库 评论
Detecting Icing on the Blades of a Wind Turbine Using a Deep Neural Network
收藏 引用
Computer Modeling in Engineering & Sciences 2023年 第2期134卷 767-782页
作者: Tingshun Li Jiaohui Xu Zesan Liu Dadi Wang Wen Tan School of Control and Computer Engineering North China Electric Power UniversityBeijing102206China R&D State Grid Information&Telecommunication Group Co.Beijing102211China
The blades of wind turbines located at high latitudes are often covered with ice in late autumn and winter,where this affects their capacity for power generation as well as their *** identifying the icing of the blade... 详细信息
来源: 维普期刊数据库 维普期刊数据库 评论
An enhanced eco-driving strategy based on reinforcement learning for connected electric vehicles:cooperative velocity and lane-changing control
收藏 引用
Journal of Intelligent and Connected Vehicles 2022年 第3期5卷 316-332页
作者: Haitao Ding Wei Li Nan Xu Jianwei Zhang State Key Laboratory of Automotive Simulation and Control Jilin UniversityChangchunChina
Purpose–This study aims to propose an enhanced eco-driving strategy based on reinforcement learning(RL)to alleviate the mileage anxiety of electric vehicles(EVs)in the connected ***/methodology/approach–In this pape... 详细信息
来源: 维普期刊数据库 维普期刊数据库 评论
Heterogeneous Network Selection Optimization Algorithm Based on a Markov Decision Model
收藏 引用
China Communications 2020年 第2期17卷 40-53页
作者: Jianli Xie Wenjuan Gao Cuiran Li School of Electronic and Information Engineering Lanzhou Jiaotong University
A network selection optimization algorithm based on the Markov decision process(MDP)is proposed so that mobile terminals can always connect to the best wireless network in a heterogeneous network *** the different typ... 详细信息
来源: 维普期刊数据库 维普期刊数据库 同方期刊数据库 同方期刊数据库 评论
A survey of inverse reinforcement learning techniques
收藏 引用
International Journal of Intelligent Computing and Cybernetics 2012年 第3期5卷 293-311页
作者: Shao Zhifei Er Meng Joo School of Electrical and Electronics Engineering Nanyang Technological UniversitySingapore
Purpose-This purpose of this paper is to provide an overview of the theoretical background and applications of inverse reinforcement learning(IRL).Design/methodology/approach-Reinforcement learning(RL)techniques provi... 详细信息
来源: 维普期刊数据库 维普期刊数据库 评论
Optimization of the Ice Storage Air Conditioning System Operation Based on Deep Reinforcement Learning
Optimization of the Ice Storage Air Conditioning System Oper...
收藏 引用
第40届中国控制会议
作者: Mingte Li Fei Xia Lin Xia College of Automation Engineering Shanghai University of Electric Power
With the intention of obtaining the room temperature and economic cost control strategy of an ice storage air conditioning system in a small office building in Shanghai,the ice storage air conditioning system is estab... 详细信息
来源: cnki会议 评论
Manipulator Control Method Based on Deep Reinforcement Learning
Manipulator Control Method Based on Deep Reinforcement Learn...
收藏 引用
第32届中国控制与决策会议
作者: Rui Zeng Manlu Liu Junjun Zhang Xinmao Li Qijie Zhou Yuanchen Jiang Special Environment Robot Technology Key Laboratory of Sichuan Province Southwest University of Science and Technology
Robotic arm have transformed the manufacturing industry and have been used for scientific exploration in human inaccessible environments. The existing manipulator control methods based on deep reinforcement learning u... 详细信息
来源: cnki会议 评论
Reinforcement Learning Control for Robot Arm Grasping Based on Improved DDPG
Reinforcement Learning Control for Robot Arm Grasping Based ...
收藏 引用
第40届中国控制会议
作者: Guangjun Qi Yuan Li School of Automation Beijing Institute of Technology
Although the traditional robot arm grasping control has high control accuracy,its price is based on high-precision hardware and lacks *** order to achieve high control accuracy and flexibility on a relatively inexpens... 详细信息
来源: cnki会议 评论
Deep Reinforcement Learning Approach for Flocking Control of Multi-agents
Deep Reinforcement Learning Approach for Flocking Control of...
收藏 引用
第40届中国控制会议
作者: Han Zhang Jin Cheng University of Jinan
Flocking behaviors learning with multi-agents deep deterministic policy gradient algorithm is addressed in this paper. Different from the non-intelligent algorithm, agents constantly update strategies by learning the ... 详细信息
来源: cnki会议 评论