Pinned Repositories
2s-AGCN
Two-Stream Adaptive Graph Convolutional Networks for Skeleton-Based Action Recognition in CVPR19
A2C
A Clearer and Simpler Synchronous Advantage Actor Critic (A2C) Implementation in TensorFlow
A3C
Deep reinforcement learning using an asynchronous advantage actor-critic (A3C) model.
A3C-LSTM-with-Tensorflow
An implementation of the A3C deep reinforcement learning method using a LSTM layer. Created with Tensorflow.
A3C_grid_world
Simple tensorflow implementation of Asynchronous Advantage Actor-Critic (A3C) for a 2-D grid environment
AirSim
Open source simulator for autonomous vehicles built on Unreal Engine / Unity, from Microsoft AI & Research
DRL4Recsys
Courses on Deep Reinforcement Learning (DRL) and DRL papers for recommender systems
LDG
PyTorch code for "Learning Temporal Attention in Dynamic Graphs with Bilinear Interactions"
Sparse-Reward-Algorithms
Implement many Sparse Reward algorithms in Gym Fetch environment
wanghuimu's Repositories
wanghuimu/awesome-deep-learning-papers
The most cited deep learning papers
wanghuimu/awesome-deep-rl
This project is for learning and researching on Deep RL. Maintained by University AI researchers.
wanghuimu/baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
wanghuimu/CopyTranslator
Foreign language reading and translation assistant based on copy and translate.
wanghuimu/CrowdNav
[ICRA19] Crowd-aware Robot Navigation with Attention-based Deep Reinforcement Learning
wanghuimu/distributedRL_MAPF
Distributed RL/IL code for Multi-Agent Path Finding (MAPF)
wanghuimu/dm_control
The DeepMind Control Suite and Package
wanghuimu/env-zoo
A curated list of reinforcement learning environments and frameworks.
wanghuimu/GitHubDaily
GitHubDaily 分享内容定期整理与分类。欢迎推荐、自荐项目,让更多人知道你的项目。
wanghuimu/irl
Code for "Incremental reinforcement learning"
wanghuimu/Keras-GAN
Keras implementations of Generative Adversarial Networks.
wanghuimu/learning_to_adapt
Learning to Adapt in Dynamic, Real-World Environment through Meta-Reinforcement Learning
wanghuimu/maddpg
Code for the MADDPG algorithm from the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
wanghuimu/MAgent
A Platform for Many-agent Reinforcement Learning
wanghuimu/magnet
MAGNet: Multi-agents control using Graph Neural Networks
wanghuimu/MARL-Papers
Paper list of multi-agent reinforcement learning (MARL)
wanghuimu/ml-agents
Unity Machine Learning Agents Toolkit
wanghuimu/multiagent-particle-envs
Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
wanghuimu/neural-mmo
Neural MMO - A Massively Multiagent Game Environment
wanghuimu/pymarl
Beta code release for Python Multi-Agent Reinforcement Learning framework
wanghuimu/pytorch-a2c-ppo-acktr
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO) and Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR).
wanghuimu/ray
A system for parallel and distributed Python that unifies the ML ecosystem.
wanghuimu/robosumo
Code for the paper "Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments"
wanghuimu/self_drive
基于树莓派的自动驾驶小车,利用树莓派和tensorflow实现小车在赛道的自动驾驶。(Self-driving car based on raspberry pi(tensorflow))
wanghuimu/SenseAct
SenseAct: A computational framework for developing real-world robot learning tasks
wanghuimu/Stage
Mobile robot simulator
wanghuimu/tensorflow-on-arm
TensorFlow for Arm
wanghuimu/tensorflow_practice
tensorflow实战练习,包括强化学习、推荐系统、nlp等
wanghuimu/the-gan-zoo
A list of all named GANs!
wanghuimu/visdom
A flexible tool for creating, organizing, and sharing visualizations of live, rich data. Supports Torch and Numpy.