LiuQiangOpenMind

南京

Pinned Repositories

afit-swarm-simulation
Language:C++00
alf
Agent Learning Framework
Language:Python00
alpha_dogfight
Language:Python3 2 21
AlphaZero_Gomoku
An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)
Language:Python00
Awesome-Meta-Learning
A curated list of Meta Learning papers, code, books, blogs, videos, datasets and other resources.
00
Complex-Network
复杂网络研究资源整理和基础知识学习
Language:Jupyter Notebook10
DeepRL-1
【深度强化学习社区】一个资料与学习内容最全的服务平台
11
dogfighter
Language:Python10
pymarl
Beta code release for Python Multi-Agent Reinforcement Learning framework
Language:Python10
sacred
Sacred is a tool to help you configure, organize, log and reproduce experiments developed at IDSIA.
Language:Python10

LiuQiangOpenMind's Repositories

LiuQiangOpenMind/alpha_dogfight
Language:Python3 2 21
LiuQiangOpenMind/Complex-Network
复杂网络研究资源整理和基础知识学习
Language:Jupyter Notebook10
LiuQiangOpenMind/dogfighter
Language:Python10
LiuQiangOpenMind/sacred
Sacred is a tool to help you configure, organize, log and reproduce experiments developed at IDSIA.
Language:Python10
LiuQiangOpenMind/alf
Agent Learning Framework
Language:Python00
LiuQiangOpenMind/Awesome-Meta-Learning
A curated list of Meta Learning papers, code, books, blogs, videos, datasets and other resources.
00
LiuQiangOpenMind/deep-Q-networks
Implementations of algorithms from the Q-learning family. Implementations inlcude: DQN, DDQN, Dueling DQN, PER+DQN, Noisy DQN, C51
Language:Jupyter Notebook
LiuQiangOpenMind/Deep-Reinforcement-Learning-Algorithms-with-PyTorch
PyTorch implementations of deep reinforcement learning algorithms and environments
LiuQiangOpenMind/Deep-Reinforcement-Learning-Hands-On
Hands-on Deep Reinforcement Learning, published by Packt
Language:Python
LiuQiangOpenMind/Deep-reinforcement-learning-with-pytorch
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
Language:Python
LiuQiangOpenMind/FINDER
FINDER - FInding key players in complex Networks through DEep Reinforcement learning (Nature Machine Intelligence)
LiuQiangOpenMind/HDDPG-HER-RND
Hierachichal DDPG + Hindsight Experience Replay + Random Network Distillation
LiuQiangOpenMind/HER
Pytorch implementation of hindsight experience replay
LiuQiangOpenMind/hyperopt
Distributed Asynchronous Hyperparameter Optimization in Python
Language:Python
LiuQiangOpenMind/imitation_learning
PyTorch implementation of some reinforcement learning algorithms: Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), V-MPO, Behavior Cloning (BC). More algorithms will be added.
LiuQiangOpenMind/leeml-notes
李宏毅《机器学习》笔记，在线阅读地址：https://datawhalechina.github.io/leeml-notes
LiuQiangOpenMind/MA-AIRL
Multi-Agent Adversarial Inverse Reinforcement Learning, ICML 2019.
Language:Python
LiuQiangOpenMind/MAAC
Code for "Actor-Attention-Critic for Multi-Agent Reinforcement Learning" ICML 2019
LiuQiangOpenMind/minimalRL
Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)
Language:Python
LiuQiangOpenMind/Own-Reinforcement-Learning
Language:Python
LiuQiangOpenMind/Policy-Gradient-Methods
Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC
Language:Jupyter Notebook
LiuQiangOpenMind/pytorch-maml-rl
Reinforcement Learning with Model-Agnostic Meta-Learning in Pytorch
LiuQiangOpenMind/quad-swarm-rl
Additional environments compatible with OpenAI gym
LiuQiangOpenMind/rainbow-is-all-you-need
Rainbow is all you need! Step-by-step tutorials from DQN to Rainbow
Language:Jupyter Notebook
LiuQiangOpenMind/ray
A fast and simple framework for building and running distributed applications.
Language:Python
LiuQiangOpenMind/RL-TF1
Reinforcement learning algorithms implemented based on tensorflow 1.x
LiuQiangOpenMind/rlkit
Collection of reinforcement learning algorithms
LiuQiangOpenMind/RLs
Reinforcement Learning Algorithms：SAC, TD3, TAC
LiuQiangOpenMind/SQDDPG
This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled by ''Shapley Q-value: A Local Reward Approach to Solve Global Reward Games''.
LiuQiangOpenMind/weightagnostic.github.io
nothing to see here yet
Language:JavaScript