hilberthu

hilberthu's Stars

changkun/modern-cpp-tutorial
📚 Modern C++ Tutorial: C++11/14/17/20 On the Fly | https://changkun.de/modern-cpp/
Language:C++23.9k 621 1313k
DLR-RM/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
Language:Python8.8k 63 1.5k1.7k
sweetice/Deep-reinforcement-learning-with-pytorch
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
Language:Python3.9k 36 34842
ikostrikov/pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
Language:Python3.6k 67 229829
higgsfield/RL-Adventure
Pytorch Implementation of DQN / DDQN / Prioritized replay/ noisy networks/ distributional values/ Rainbow/ hierarchical RL
Language:Jupyter Notebook3k 72 22589
werner-duvaud/muzero-general
MuZero
Language:Python2.5k 74 175607
P3TERX/GeoLite.mmdb
MaxMind's GeoIP2 GeoLite2 Country, City, and ASN databases
2.1k 60 14283
oschwald/geoip2-golang
Unofficial MaxMind GeoIP2 Reader for Go
Language:Go1.9k 33 58193
nikhilbarhate99/PPO-PyTorch
Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch
Language:Python1.6k 9 61342
rail-berkeley/softlearning
Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.
Language:Python1.2k 37 103238
PacktPublishing/Deep-Reinforcement-Learning-Hands-On-Second-Edition
Deep-Reinforcement-Learning-Hands-On-Second-Edition, published by Packt
Language:Jupyter Notebook1.1k 25 44534
openai/random-network-distillation
Code for the paper "Exploration by Random Network Distillation"
Language:Python872 26 20160
Neveryu/vue-cms
基于 Vue 和 ElementUI 构建的一个企业级后台管理系统
Language:TypeScript628 22 10230
koulanurag/muzero-pytorch
Pytorch Implementation of MuZero
Language:Python335 21 656
tarequeh/DES
Implementation of Data Encryption Standard (DES) in C
Language:C258 15 8140
kwotsin/TensorFlow-Xception
TensorFlow implementation of the Xception Model by François Chollet
Language:Python206 15 1190
Unity-Technologies/com.unity.services.samples
A collection of working examples of Unity Gaming Services.
141 19 056
vwxyzjn/invalid-action-masking
Source Code for A Closer Look at Invalid Action Masking in Policy Gradient Algorithms
Language:Python134 2 322
liuanji/WU-UCT
A novel parallel UCT algorithm with linear speedup and negligible performance loss.
Language:Python105 4 824
reionwong/QNavigationWidget
this is a simple navigation widget for qt.
Language:C++76 8 039
xiyoo0812/quanta
A Game Server Engine based on Lua!
Language:C67 4 623
xlnwel/model-free-algorithms
TD3, SAC, IQN, Rainbow, PPO, Ape-X and etc. in TF1.x
Language:Python60 4 210
Akagi201/hmac-sha1
Standalone implementation of `HMAC()` + `EVP_sha1()` in `OpenSSL`
Language:C53 4 128
lyokato/cpp-cryptlite
C++ library that supports base64-encoding, sha1/sha256 hashing, and hmac calculation
Language:C++48 4 214
MoMe36/BranchingDQN
BranchingDQN
Language:Python48 3 35
mpeterv/sha1
Implementation of SHA-1 and HMAC-SHA-1 in pure Lua.
Language:Lua32 3 310
crafts-dev/ebooks-2
ebooks
21 2 02
YangShengqi/cartpole_ppo_lstm
Language:Python13 2 02
srinivr/rl
Implementation of DQN, n-step DQN and TreeQN
Language:Python3 1 01
ZhouMM-jervis/Multi-step-DQN
Language:Python22