Eyunfang's Stars
openai/multiagent-particle-envs
Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
ChuaCheowHuan/reinforcement_learning
My reproduction of various reinforcement learning algorithms (DQN variants, A3C, DPPO, RND with PPO) in Tensorflow.
JohannesAck/tf2multiagentrl
Clean implementation of Multi-Agent Reinforcement Learning methods (MADDPG, MATD3, MASAC, MAD4PG) in TensorFlow 2.x
princewen/tensorflow_practice
tensorflow实战练习,包括强化学习、推荐系统、nlp等
sweetice/Deep-reinforcement-learning-with-pytorch
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
iperov/DeepFaceLab
DeepFaceLab is the leading software for creating deepfakes.
thu-ml/tianshou
An elegant PyTorch deep reinforcement learning library.
eyounx/VirtualTaobao
Virtual-Taobao simulators with OpenAI Gym interface
uvipen/Super-mario-bros-A3C-pytorch
Asynchronous Advantage Actor-Critic (A3C) algorithm for Super Mario Bros
DaVilla7/Papers-PPT
It'a set of the papers I have read and the presentation I did for them
epignatelli/human-level-control-through-deep-reinforcement-learning
A jax/stax implementation of: Mnih, V., Kavukcuoglu, K., Silver, D., Rusu, A.A., Veness, J., Bellemare, M.G., Graves, A., Riedmiller, M., Fidjeland, A.K., Ostrovski, G. and Petersen, S., 2015. Human-level control through deep reinforcement learning. nature, 518(7540), pp.529-533.
hill-a/stable-baselines
A fork of OpenAI Baselines, implementations of reinforcement learning algorithms
tensorpack/tensorpack
A Neural Net Training Interface on TensorFlow, with focus on speed + flexibility
imarvinle/awesome-cs-books
🔥 经典编程书籍大全,涵盖:计算机系统与网络、系统架构、算法与数据结构、前端开发、后端开发、移动开发、数据库、测试、项目与团队、程序员职业修炼、求职面试等
tensorflow/tensorflow
An Open Source Machine Learning Framework for Everyone
BlueFisher/Reinforcement-Learning
OneRaynyDay/RLEngine
A simple reinforcement learning simulation engine for OpenAI's gym.
rll/rllab
rllab is a framework for developing and evaluating reinforcement learning algorithms, fully compatible with OpenAI Gym.
ikostrikov/pytorch-a3c
PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".
jingweiz/pytorch-rl
Deep Reinforcement Learning with pytorch & visdom
onlytailei/A3C-PyTorch
PyTorch implementation of Advantage async actor-critic Algorithms (A3C) in PyTorch
jaromiru/AI-blog
Accompanying repository for Let's make a DQN / A3C series.
crimx/ext-saladict
🥗 All-in-one professional pop-up dictionary and page translator which supports multiple search modes, page translations, new word notebook and PDF selection searching.
alibaba/MNN
MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba
chen1415-UNSW/COMP9020
faruto/Libsvm-FarutoUltimate-Version
Libsvm-FarutoUltimate Version
zhoubolei/introRL
Intro to Reinforcement Learning (强化学习纲要)
PaddlePaddle/PARL
A high-performance distributed training framework for Reinforcement Learning
ucla-rlcourse/RLexample
Some basic examples of playing with RL
openai/gym
A toolkit for developing and comparing reinforcement learning algorithms.