Pinned Repositories
A3C-PyTorch
PyTorch implementation of Advantage async actor-critic Algorithms (A3C) in PyTorch
Algorithmic-Trading-Tools
Algorithmic Trading Tools
APEX
Autonomous exploration, active learning and human guidance with open-source Poppy humanoid robot platform and Explauto library
async_deep_reinforce
Asynchronous Methods for Deep Reinforcement Learning
atari-rl
Atari - Deep Reinforcement Learning algorithms in TensorFlow
awesome-knowledge-distillation
Awesome Knowledge Distillation
awesome-self-supervised-learning
A curated list of awesome Self-Supervised methods
BaRC
Contains the code for "BaRC: Backward Reachability Curriculum for Robotic Reinforcement Learning" by Boris Ivanovic, James Harrison, Apoorva Sharma, Mo Chen, Marco Pavone.
baselines-rudder
RUDDER for ATARI games with delayed rewards in OpenAI Baselines package
Bayesian-Neural-Networks
Pytorch implementations of Bayes By Backprop, MC Dropout, SGLD, the Local Reparametrization Trick, KF-Laplace and more
ahavenoname's Repositories
ahavenoname/awesome-self-supervised-learning
A curated list of awesome Self-Supervised methods
ahavenoname/Bayesian-Neural-Networks
Pytorch implementations of Bayes By Backprop, MC Dropout, SGLD, the Local Reparametrization Trick, KF-Laplace and more
ahavenoname/count_based_exploration_sr
ahavenoname/Deep-Reinforcement-Learning-Hands-On
Hands-on Deep Reinforcement Learning, published by Packt
ahavenoname/deep_abstract_q_network
ahavenoname/DeepLearning-500-questions
深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为18个章节,近30万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系scutjy2015@163.com 版权所有,违权必究 Tan 2018.06
ahavenoname/DEHRL
Diversity−Driven Extensible Hierarchical Reinforcement Learning. AAAI 2019. Oral Presentation.
ahavenoname/DHP
Predicting Head Movement in Panoramic Video: A Deep Reinforcement Learning Approach. TPAMI 2018.
ahavenoname/DIAYN
ahavenoname/episodic-curiosity
Tensorflow/Keras code and trained models for Episodic Curiosity Through Reachability
ahavenoname/exp4nav
(ICLR 2019) Learning Exploration Policies for Navigation
ahavenoname/go-explore
Code for Go-Explore: a New Approach for Hard-Exploration Problems
ahavenoname/google-research
Google AI Research
ahavenoname/ICLR2019-RL-Papers
The Reinforcement-Learning-Related Papers of ICLR 2019
ahavenoname/infinite-horizon-off-policy-estimation
ahavenoname/insightface
Face Analysis Project on MXNet
ahavenoname/Model-Free-Episodic-Control-1
Model-Free-Episodic-Control implementation based on DeepMinds paper: http://arxiv.org/abs/1606.04460
ahavenoname/models
Models and examples built with TensorFlow
ahavenoname/multiagent-particle-envs
Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
ahavenoname/multiworld
Multitask Environments for RL
ahavenoname/pytorch-a2c-ppo-acktr
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO) and Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR).
ahavenoname/q-learning-delusion
A counterexample for Q-Learning, discussed in "Non-delusional Q-learning and value-iteration."
ahavenoname/rlkit
Collection of reinforcement learning algorithms
ahavenoname/SimionZoo
A workbench for online model-free Reinforcement Learning on continuous control problems
ahavenoname/SPTM
[ICLR 2018] Tensorflow/Keras code for Semi-parametric Topological Memory for Navigation
ahavenoname/synthetic-computer-vision
A list of synthetic dataset and tools for computer vision
ahavenoname/UCRL
ahavenoname/visual_foresight
Visual Foresight: Model-Based Deep Reinforcement Learning for Vision-Based Robotic Control
ahavenoname/worldmodels.github.io
World Models
ahavenoname/WorldModelsExperiments
World Models Experiments