基本信息来源于合作网站,原文需代理用户跳转至来源网站获取       
摘要:
In this paper, a collection of value-based quantum reinforcement learning algorithms are introduced which use Grover’s algorithm to update the policy, which is stored as a superposition of qubits associated with each possible action, and their parameters are explored. These algorithms may be grouped in two classes, one class which uses value functions (V(s)) and new class which uses action value functions (Q(s,a)). The new (Q(s,a))-based quantum algorithms are found to converge faster than V(s)-based algorithms, and in general the quantum algorithms are found to converge in fewer iterations than their classical counterparts, netting larger returns during training. This is due to fact that the (Q(s,a)) algorithms are more precise than those based on V(s), meaning that updates are incorporated into the value function more efficiently. This effect is also enhanced by the observation that the Q(s,a)-based algorithms may be trained with higher learning rates. These algorithms are then extended by adding multiple value functions, which are observed to allow larger learning rates and have improved convergence properties in environments with stochastic rewards, the latter of which is further improved by the probabilistic nature of the quantum algorithms. Finally, the quantum algorithms were found to use less CPU time than their classical counterparts overall, meaning that their benefits may be realized even without a full quantum computer.
推荐文章
改进的Q-Learning算法及其在路径规划中的应用
路径规划
人工智能
强化学习
Q-Learning
基于Q-learning的机会频谱接入信道选择算法
认知无线电
机会频谱接入
Q学习
信道选择
Boltzmann规则
基于生成模型的Q-learning二分类算法
Q-learning
生成模型
二分类
最小二乘时序差分算法
半梯度下降法
基于情感计算和Q-learning的agent自主追逐行为过程研究
情感计算
Q学习
博弈论
多智能体
自主追逐
内容分析
关键词云
关键词热度
相关文献总数  
(/次)
(/年)
文献信息
篇名 Quantum Multiple Q-Learning
来源期刊 智能科学国际期刊(英文) 学科 医学
关键词 QUANTUM COMPUTING REINFORCEMENT LEARNING Q-LEARNING
年,卷(期) 2019,(1) 所属期刊栏目
研究方向 页码范围 1-22
页数 22页 分类号 R73
字数 语种
DOI
五维指标
传播情况
(/次)
(/年)
引文网络
引文网络
二级参考文献  (0)
共引文献  (0)
参考文献  (0)
节点文献
引证文献  (0)
同被引文献  (0)
二级引证文献  (0)
2019(0)
  • 参考文献(0)
  • 二级参考文献(0)
  • 引证文献(0)
  • 二级引证文献(0)
研究主题发展历程
节点文献
QUANTUM
COMPUTING
REINFORCEMENT
LEARNING
Q-LEARNING
研究起点
研究来源
研究分支
研究去脉
引文网络交叉学科
相关学者/机构
期刊影响力
智能科学国际期刊(英文)
季刊
2163-0283
武汉市江夏区汤逊湖北路38号光谷总部空间
出版文献量(篇)
102
总下载数(次)
0
总被引数(次)
0
论文1v1指导