lxb678's Stars
jcwleo/curiosity-driven-exploration-pytorch
Curiosity-driven Exploration by Self-supervised Prediction
jcwleo/mario_rl
chagmgang/pytorch_ppo_rl
Pytorch implementation of intrinsic curiosity module with proximal policy optimization
deligentfool/policy_based_RL
The implement of the policy gradient RL algorithm with pytorch
adik993/ppo-pytorch
Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)
MdMuzahid/RL_ICM
Intrinsic Curiosity Module
rlcode/per
Prioritized Experience Replay (PER) implementation in PyTorch
AmazingAng/WTF-DeepRL
Deep RL algorithm in pytorch
summerANDcode/SAC_PER
XinJingHao/DRL-Pytorch
Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)
deligentfool/NFSP_lasertag
The implement of Neural Fictitious Self Play with pytorch
Srivatsan-Srinivasan/poker-NFSP
NimaPublic/NFSP-CNN
cptanalatriste/nfsp-playground
Exploring Neural Fictitious Self-Play
deligentfool/leduc_nfsp
The implement of Neural Fictitious Self Play with pytorch
dantodor/Neural-Ficititious-Self-Play-in-Imperfect-Information-Games
This Project is based on Heinrich and Silvers Work "Neural Fictitious Self-Play in Imperfect Information Games". It includes the whole Game-Environment "Leduc Hold'em" which is inspired by the OpenAI Gym-Project. Furthermore it includes an NFSP Agent.
arixlin/RL_NFSP
thomasj02/nfsp-pytorch
Neural Fictitious Self-Play in Pytorch
liuxinyuanxy/NFSPwithHuman
人类-智能体协同德州扑克平台及AI开发,拓展自http://turingai.ia.ac.cn/
younggyoseo/pytorch-nfsp
Implementation of Deep Reinforcement Learning from Self-Play in Imperfect-Information Games (Heinrich and Silver, 2016)
PhDChe/Poker-1
Various explorations into the game of Poker using MCTS, NFSP, and image-recognition/web-scraping
Lizhi-sjtu/DRL-code-pytorch
Concise pytorch implements of DRL algorithms, including REINFORCE, A2C, DQN, PPO(discrete and continuous), DDPG, TD3, SAC.
BY571/SAC_discrete
PyTorch implementation of the discrete Soft-Actor-Critic algorithm.
Meta-YZ/SAC-Discrete
离散动作的SAC
XinJingHao/SAC-Discrete-Pytorch
A clean and robust Pytorch implementation of SAC on discrete action space
p-christ/Deep-Reinforcement-Learning-Algorithms-with-PyTorch
PyTorch implementations of deep reinforcement learning algorithms and environments
toshikwa/sac-discrete.pytorch
PyTorch implementation of SAC-Discrete.
yhl2333/p_guide-dr_missile
gradesign
yhl2333/gradu_design
dqn-aircombat
PKU-MARL/HARL
Official implementation of HARL algorithms based on PyTorch.