Issues
- 0
- 0
- 3
有没有代码示例呢 ?
#37 opened by 2017040264 - 1
- 0
3.4.1节动作价值函数
#52 opened by TronYY - 3
p101 题3答案B文字错误
#46 opened by itlogger - 0
REINFORCE with Baseline 中的 slides 出现错误
#51 opened by jzhangCSER01 - 3
请问如何cite这本书呢?
#39 opened by yyanhan - 0
关于s,S与a,A间的相互转化
#50 opened by Clayfigure - 0
强化学习视频中使用的讲义和该 repo 中的 slides 对不上
#49 opened by Whisht - 3
跪求老师更新一节PPO的讲解视频
#27 opened by yanhongjin228228 - 2
7.3.2证明中的typo
#48 opened by yuechuhaoxi020609 - 0
10.3.3 小节漏字
#47 opened by aishangcengloua - 1
7.3.2 节可能的错误
#45 opened by aishangcengloua - 3
第五章SARSA算法描述是否有误
#44 opened by txsniper - 4
我不清楚这里是否写错了
#41 opened by Oliver-F1 - 1
劝你识相点,给我入驻B站(手动狗头)
#43 opened by Oliver-F1 - 0
github上的DRL.pdf是最新版本吗?
#42 opened by ShuhuaGao - 0
- 1
建议增加PPO和SAC讲解
#38 opened by TimHo0331 - 2
4.2.1 一术语使用不妥
#19 opened by AtomicVar - 1
- 1
- 2
感谢王先生难能可贵的分享,能否给书籍增加书签目录?
#36 opened by rocke2020 - 0
第9章笔误及第6章疑问
#29 opened by DeepGeGe - 0
Nothing
#35 opened by 2017040264 - 2
6.2.4 使用目标网络: 可能的错误
#34 opened by 2017040264 - 2
可能的错误:6.2.1小节--自举导致偏差的传播
#32 opened by 2017040264 - 1
8.1节可能的小错误
#33 opened by KID0031 - 1
- 6
对前两章基础部分内容的读后反馈
#15 opened by hydt - 0
- 0
前9章读后感
#28 opened by LeeChunley - 0
Double DQN gamma 参数
#25 opened by Code-Notebook - 0
3.5 添加相关概念
#23 opened by Code-Notebook - 1
4.4 Q 学习算法 P47 落下一个字
#24 opened by Code-Notebook - 2
很不错的书,希望增加目录,还有文中公式,引用的超链接
#12 opened by kli-casia - 0
基于强化学习的知识图谱推理
#21 opened by Joyrocky - 0
建议增加值分布强化学习的内容
#20 opened by lsyysl9711 - 2
阅读反馈
#18 opened by musicaudience - 1
ImageNet 在深度学习中的应用
#17 opened by Benjizhang - 1
- 2
确定策略梯度章节的改进建议
#14 opened by kli-casia - 6
TRPO中的一个小问题
#13 opened by kli-casia - 3
4.3.1算法推导的第一个公式
#8 opened by wangchuan - 1
- 1
Missing right parenthesis in Appendix A
#10 opened by Renovamen - 2
- 5
Question About P48
#7 opened by xiaobanni - 3
第四页有一处错字
#6 opened by hydelovegood