nerdylinius

Pinned Repositories

async-rl
An attempt to reproduce the results of "Asynchronous Methods for Deep Reinforcement Learning" (http://arxiv.org/abs/1602.01783)
Language:Python0 2 00
Asynchronous-Methods-for-Deep-Reinforcement-Learning
Using a paper from Google DeepMind I've developed a new version of the DQN using threads exploration instead of memory replay as explain in here: http://arxiv.org/pdf/1602.01783v1.pdf I used the one-step-Q-learning pseudocode, and now we can train the Pong game in less than 20 hours and without any GPU or network distribution.
Language:Python0 2 00
demo
Language:HTML0 3 00
ray
A system for parallel and distributed Python that unifies the ML ecosystem.
Language:Python0 2 00
tensorflow
Computation using data flow graphs for scalable machine learning
Language:C++0 2 00

nerdylinius's Repositories

nerdylinius/async-rl
An attempt to reproduce the results of "Asynchronous Methods for Deep Reinforcement Learning" (http://arxiv.org/abs/1602.01783)
Language:Python0 2 00
nerdylinius/Asynchronous-Methods-for-Deep-Reinforcement-Learning
Using a paper from Google DeepMind I've developed a new version of the DQN using threads exploration instead of memory replay as explain in here: http://arxiv.org/pdf/1602.01783v1.pdf I used the one-step-Q-learning pseudocode, and now we can train the Pong game in less than 20 hours and without any GPU or network distribution.
Language:Python0 2 00
nerdylinius/demo
Language:HTML0 3 00
nerdylinius/ray
A system for parallel and distributed Python that unifies the ML ecosystem.
Language:Python0 2 00
nerdylinius/tensorflow
Computation using data flow graphs for scalable machine learning
Language:C++0 2 00