咨询与建议

限定检索结果

文献类型

  • 2 篇 期刊文献

馆藏范围

  • 2 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 1 篇 理学
    • 1 篇 数学
  • 1 篇 工学
    • 1 篇 控制科学与工程
    • 1 篇 计算机科学与技术...
    • 1 篇 软件工程
  • 1 篇 管理学
    • 1 篇 管理科学与工程(可...

主题

  • 2 篇 value function a...
  • 1 篇 policy evaluatio...
  • 1 篇 temporal differe...
  • 1 篇 q-learning
  • 1 篇 air combat simul...
  • 1 篇 deep reinforceme...
  • 1 篇 fuzzy art

机构

  • 1 篇 school of automa...
  • 1 篇 state key labora...
  • 1 篇 college of intel...

作者

  • 1 篇 xiao song
  • 1 篇 xin xu
  • 1 篇 yaofei ma
  • 1 篇 qiang fang
  • 1 篇 yujun zeng
  • 1 篇 guanghong gong
  • 1 篇 junkai ren
  • 1 篇 yichuan zhang
  • 1 篇 yixing lan
  • 1 篇 yanan zhou

语言

  • 2 篇 英文
检索条件"主题词=value function approximation"
2 条 记 录,以下是1-10 订阅
排序:
Deep reinforcement learning using least-squares truncated temporal-difference
收藏 引用
CAAI Transactions on Intelligence Technology 2024年 第2期9卷 425-439页
作者: Junkai Ren Yixing Lan Xin Xu Yichuan Zhang Qiang Fang Yujun Zeng College of Intelligence Science and Technology National University of Defense TechnologyChangshaChina State Key Laboratory of Astronautic Dynamics Xi'an Satellite Control CenterXi'anChina
Policy evaluation(PE)is a critical sub-problem in reinforcement learning,which estimates the value function for a given policy and can be used for policy ***,there still exist some limitations in current PE methods,su... 详细信息
来源: 维普期刊数据库 维普期刊数据库 评论
Hierarchical fuzzy ART for Q-learning and its application in air combat simulation
收藏 引用
International Journal of Modeling, Simulation, and Scientific Computing 2017年 第4期8卷 205-223页
作者: Yanan Zhou Yaofei Ma Xiao Song Guanghong Gong School of Automation Science and Electrical Engineering Beihang University XueYuan Road No.37HaiDian District Beijing 100191P.R.China
value function approximation plays an important role in reinforcement learning(RL)with continuous state space,which is widely used to build decision models in *** traditional approaches require experienced designers t... 详细信息
来源: 维普期刊数据库 维普期刊数据库 评论