Issues
- 1
Questions about training A3C
#67 opened by Tord-Zhang - 1
About A3C
#66 opened by Tord-Zhang - 2
actor-critic based
#65 opened by Tord-Zhang - 4
How to process with the salient map?
#64 opened by happywu - 8
Implement optimality tightening
#60 opened by Kaixhin - 1
Partition number and segments
#63 opened by ColdCodeCool - 14
Implement asynchronous methods
#5 opened by lake4790k - 0
- 0
Implement Pop-Art
#6 opened by Kaixhin - 7
What is the actual performance?
#61 opened by cgel - 2
Why is the current sharedRmsprop thread safe?
#59 opened by pengsun - 1
gnuplots memory unreleased
#58 opened by nadavbh12 - 2
Finish prioritised experience replay
#42 opened by Kaixhin - 1
Possible improvements on speeding up
#54 opened by YurongYou - 2
Disagreements with the async paper
#53 opened by YurongYou - 1
problem in Agent.lua
#55 opened by Hislocked - 2
Load models like environments
#51 opened by mryellow - 4
Async A3C Network Outputs NaN
#50 opened by lordzapharos - 2
- 31
Recurrent Dqn
#8 opened by lake4790k - 1
OpenAI integration
#13 opened by lake4790k - 0
Allow non-visual environments
#44 opened by Kaixhin - 5
Exploration with pseudo counts
#34 opened by lake4790k - 11
Decouple Catch vs. Atari from code
#26 opened by Kaixhin - 34
- 0
Implement Memory Q-networks
#36 opened by Kaixhin - 0
Implement Retrace(λ)
#37 opened by Kaixhin - 1
Refactoring master before async merge
#23 opened by lake4790k - 0
correct SarsaAgent
#28 opened by lake4790k - 0
Unify ER and Async validation logic
#31 opened by lake4790k - 0
Add LSTM support to all async modes
#32 opened by lake4790k - 8
Hierarchical Dqn
#9 opened by lake4790k - 9
Multi gpu support
#10 opened by lake4790k - 0
Agent.valMemory and validate()
#16 opened by lake4790k - 12
stateBuffer issue with Catch on CPU
#11 opened by lake4790k - 5
Fix bootstrapped DQN
#7 opened by Kaixhin - 0
Test different implementations
#4 opened by Kaixhin - 3
bootstraps version when evaluating
#3 opened by jingweiz - 7
Hi,Kaixhin.Ask you for a question
#2 opened by Alex-zhai