hilberthu's Stars
changkun/modern-cpp-tutorial
📚 Modern C++ Tutorial: C++11/14/17/20 On the Fly | https://changkun.de/modern-cpp/
DLR-RM/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
sweetice/Deep-reinforcement-learning-with-pytorch
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
ikostrikov/pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
higgsfield/RL-Adventure
Pytorch Implementation of DQN / DDQN / Prioritized replay/ noisy networks/ distributional values/ Rainbow/ hierarchical RL
werner-duvaud/muzero-general
MuZero
P3TERX/GeoLite.mmdb
MaxMind's GeoIP2 GeoLite2 Country, City, and ASN databases
oschwald/geoip2-golang
Unofficial MaxMind GeoIP2 Reader for Go
nikhilbarhate99/PPO-PyTorch
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
rail-berkeley/softlearning
Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.
PacktPublishing/Deep-Reinforcement-Learning-Hands-On-Second-Edition
Deep-Reinforcement-Learning-Hands-On-Second-Edition, published by Packt
openai/random-network-distillation
Code for the paper "Exploration by Random Network Distillation"
Neveryu/vue-cms
基于 Vue 和 ElementUI 构建的一个企业级后台管理系统
koulanurag/muzero-pytorch
Pytorch Implementation of MuZero
tarequeh/DES
Implementation of Data Encryption Standard (DES) in C
kwotsin/TensorFlow-Xception
TensorFlow implementation of the Xception Model by François Chollet
Unity-Technologies/com.unity.services.samples
A collection of working examples of Unity Gaming Services.
vwxyzjn/invalid-action-masking
Source Code for A Closer Look at Invalid Action Masking in Policy Gradient Algorithms
liuanji/WU-UCT
A novel parallel UCT algorithm with linear speedup and negligible performance loss.
reionwong/QNavigationWidget
this is a simple navigation widget for qt.
xiyoo0812/quanta
A Game Server Engine based on Lua!
xlnwel/model-free-algorithms
TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x
Akagi201/hmac-sha1
Standalone implementation of `HMAC()` + `EVP_sha1()` in `OpenSSL`
lyokato/cpp-cryptlite
C++ library that supports base64-encoding, sha1/sha256 hashing, and hmac calculation
MoMe36/BranchingDQN
BranchingDQN
mpeterv/sha1
Implementation of SHA-1 and HMAC-SHA-1 in pure Lua.
crafts-dev/ebooks-2
ebooks
YangShengqi/cartpole_ppo_lstm
srinivr/rl
Implementation of DQN, n-step DQN and TreeQN
ZhouMM-jervis/Multi-step-DQN