Pinned Repositories
afit-swarm-simulation
alf
Agent Learning Framework
alpha_dogfight
AlphaZero_Gomoku
An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)
Awesome-Meta-Learning
A curated list of Meta Learning papers, code, books, blogs, videos, datasets and other resources.
Complex-Network
复杂网络研究资源整理和基础知识学习
DeepRL-1
【深度强化学习社区】一个资料与学习内容最全的服务平台
dogfighter
pymarl
Beta code release for Python Multi-Agent Reinforcement Learning framework
sacred
Sacred is a tool to help you configure, organize, log and reproduce experiments developed at IDSIA.
LiuQiangOpenMind's Repositories
LiuQiangOpenMind/alpha_dogfight
LiuQiangOpenMind/Complex-Network
复杂网络研究资源整理和基础知识学习
LiuQiangOpenMind/dogfighter
LiuQiangOpenMind/sacred
Sacred is a tool to help you configure, organize, log and reproduce experiments developed at IDSIA.
LiuQiangOpenMind/alf
Agent Learning Framework
LiuQiangOpenMind/Awesome-Meta-Learning
A curated list of Meta Learning papers, code, books, blogs, videos, datasets and other resources.
LiuQiangOpenMind/deep-Q-networks
Implementations of algorithms from the Q-learning family. Implementations inlcude: DQN, DDQN, Dueling DQN, PER+DQN, Noisy DQN, C51
LiuQiangOpenMind/Deep-Reinforcement-Learning-Algorithms-with-PyTorch
PyTorch implementations of deep reinforcement learning algorithms and environments
LiuQiangOpenMind/Deep-Reinforcement-Learning-Hands-On
Hands-on Deep Reinforcement Learning, published by Packt
LiuQiangOpenMind/Deep-reinforcement-learning-with-pytorch
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
LiuQiangOpenMind/FINDER
FINDER - FInding key players in complex Networks through DEep Reinforcement learning (Nature Machine Intelligence)
LiuQiangOpenMind/HDDPG-HER-RND
Hierachichal DDPG + Hindsight Experience Replay + Random Network Distillation
LiuQiangOpenMind/HER
Pytorch implementation of hindsight experience replay
LiuQiangOpenMind/hyperopt
Distributed Asynchronous Hyperparameter Optimization in Python
LiuQiangOpenMind/imitation_learning
PyTorch implementation of some reinforcement learning algorithms: Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), V-MPO, Behavior Cloning (BC). More algorithms will be added.
LiuQiangOpenMind/leeml-notes
李宏毅《机器学习》笔记,在线阅读地址:https://datawhalechina.github.io/leeml-notes
LiuQiangOpenMind/MA-AIRL
Multi-Agent Adversarial Inverse Reinforcement Learning, ICML 2019.
LiuQiangOpenMind/MAAC
Code for "Actor-Attention-Critic for Multi-Agent Reinforcement Learning" ICML 2019
LiuQiangOpenMind/minimalRL
Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)
LiuQiangOpenMind/Own-Reinforcement-Learning
LiuQiangOpenMind/Policy-Gradient-Methods
Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC
LiuQiangOpenMind/pytorch-maml-rl
Reinforcement Learning with Model-Agnostic Meta-Learning in Pytorch
LiuQiangOpenMind/quad-swarm-rl
Additional environments compatible with OpenAI gym
LiuQiangOpenMind/rainbow-is-all-you-need
Rainbow is all you need! Step-by-step tutorials from DQN to Rainbow
LiuQiangOpenMind/ray
A fast and simple framework for building and running distributed applications.
LiuQiangOpenMind/RL-TF1
Reinforcement learning algorithms implemented based on tensorflow 1.x
LiuQiangOpenMind/rlkit
Collection of reinforcement learning algorithms
LiuQiangOpenMind/RLs
Reinforcement Learning Algorithms:SAC, TD3, TAC
LiuQiangOpenMind/SQDDPG
This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled by ''Shapley Q-value: A Local Reward Approach to Solve Global Reward Games''.
LiuQiangOpenMind/weightagnostic.github.io
nothing to see here yet