ahavenoname

Pinned Repositories

A3C-PyTorch
PyTorch implementation of Advantage async actor-critic Algorithms (A3C) in PyTorch
Language:Python0 2 00
Algorithmic-Trading-Tools
Algorithmic Trading Tools
Language:Python0 1 00
APEX
Autonomous exploration, active learning and human guidance with open-source Poppy humanoid robot platform and Explauto library
Language:Python0 2 00
async_deep_reinforce
Asynchronous Methods for Deep Reinforcement Learning
Language:Python0 2 01
atari-rl
Atari - Deep Reinforcement Learning algorithms in TensorFlow
Language:Python0 2 00
awesome-knowledge-distillation
Awesome Knowledge Distillation
0 2 00
awesome-self-supervised-learning
A curated list of awesome Self-Supervised methods
0 2 00
BaRC
Contains the code for "BaRC: Backward Reachability Curriculum for Robotic Reinforcement Learning" by Boris Ivanovic, James Harrison, Apoorva Sharma, Mo Chen, Marco Pavone.
Language:Python0 2 00
baselines-rudder
RUDDER for ATARI games with delayed rewards in OpenAI Baselines package
Language:Python0 2 00
Bayesian-Neural-Networks
Pytorch implementations of Bayes By Backprop, MC Dropout, SGLD, the Local Reparametrization Trick, KF-Laplace and more
Language:Jupyter Notebook0 1 00

ahavenoname's Repositories

ahavenoname/awesome-self-supervised-learning
A curated list of awesome Self-Supervised methods
0 2 00
ahavenoname/Bayesian-Neural-Networks
Pytorch implementations of Bayes By Backprop, MC Dropout, SGLD, the Local Reparametrization Trick, KF-Laplace and more
Language:Jupyter Notebook0 1 00
ahavenoname/count_based_exploration_sr
Language:Python1 0
ahavenoname/Deep-Reinforcement-Learning-Hands-On
Hands-on Deep Reinforcement Learning, published by Packt
Language:Python1 0
ahavenoname/deep_abstract_q_network
Language:Python2 0
ahavenoname/DeepLearning-500-questions
深度学习500问，以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述，以帮助自己及有需要的读者。全书分为18个章节，近30万字。由于水平有限，书中不妥之处恳请广大读者批评指正。未完待续............ 如有意合作，联系scutjy2015@163.com 版权所有，违权必究 Tan 2018.06
Language:TeX2 0
ahavenoname/DEHRL
Diversity−Driven Extensible Hierarchical Reinforcement Learning. AAAI 2019. Oral Presentation.
Language:Python1 0
ahavenoname/DHP
Predicting Head Movement in Panoramic Video: A Deep Reinforcement Learning Approach. TPAMI 2018.
Language:Python2 01
ahavenoname/DIAYN
Language:Python1 0
ahavenoname/episodic-curiosity
Tensorflow/Keras code and trained models for Episodic Curiosity Through Reachability
Language:Jupyter Notebook2 0
ahavenoname/exp4nav
(ICLR 2019) Learning Exploration Policies for Navigation
Language:Python2 0
ahavenoname/go-explore
Code for Go-Explore: a New Approach for Hard-Exploration Problems
Language:Python2 0
ahavenoname/google-research
Google AI Research
Language:Jupyter Notebook1 0
ahavenoname/ICLR2019-RL-Papers
The Reinforcement-Learning-Related Papers of ICLR 2019
2 0
ahavenoname/infinite-horizon-off-policy-estimation
Language:Python2 0
ahavenoname/insightface
Face Analysis Project on MXNet
Language:Python1 0
ahavenoname/Model-Free-Episodic-Control-1
Model-Free-Episodic-Control implementation based on DeepMinds paper: http://arxiv.org/abs/1606.04460
Language:Python2 0
ahavenoname/models
Models and examples built with TensorFlow
ahavenoname/multiagent-particle-envs
Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
Language:Python2 0
ahavenoname/multiworld
Multitask Environments for RL
Language:Python2 0
ahavenoname/pytorch-a2c-ppo-acktr
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO) and Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR).
Language:Python2 0
ahavenoname/q-learning-delusion
A counterexample for Q-Learning, discussed in "Non-delusional Q-learning and value-iteration."
Language:Jupyter Notebook2 0
ahavenoname/rlkit
Collection of reinforcement learning algorithms
Language:Python2 0
ahavenoname/SimionZoo
A workbench for online model-free Reinforcement Learning on continuous control problems
Language:C++1 0
ahavenoname/SPTM
[ICLR 2018] Tensorflow/Keras code for Semi-parametric Topological Memory for Navigation
Language:Python2 0
ahavenoname/synthetic-computer-vision
A list of synthetic dataset and tools for computer vision
Language:Python1 0
ahavenoname/UCRL
Language:Python1 0
ahavenoname/visual_foresight
Visual Foresight: Model-Based Deep Reinforcement Learning for Vision-Based Robotic Control
Language:Python1 0
ahavenoname/worldmodels.github.io
World Models
Language:HTML1 0
ahavenoname/WorldModelsExperiments
World Models Experiments
Language:Jupyter Notebook1 0