Pinned Repositories
alphadev
AutoOpt_RL
cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Continues_ppo
decision-transformer
Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.
DI-engine
OpenDILab Decision AI Engine
DistributedEvolutionaryComputation
A (still growing) paper list of Evolutionary Computation (EC) published in some (rather all) top-tier (and also EC-focused) journals and conferences. For EC-focused publications, only Parallel/Distributed EC are covered.
DTQN
Deep Transformer Q-Networks for Partially Observable Reinforcement Learning
flash-linux0.11-talk
你管这破玩意叫操作系统源码 — 像小说一样品读 Linux 0.11 核心代码
OpenAI-baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
Hust1Booze's Repositories
Hust1Booze/Continues_ppo
Hust1Booze/alphadev
Hust1Booze/AutoOpt_RL
Hust1Booze/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Hust1Booze/decision-transformer
Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.
Hust1Booze/DI-engine
OpenDILab Decision AI Engine
Hust1Booze/DistributedEvolutionaryComputation
A (still growing) paper list of Evolutionary Computation (EC) published in some (rather all) top-tier (and also EC-focused) journals and conferences. For EC-focused publications, only Parallel/Distributed EC are covered.
Hust1Booze/DTQN
Deep Transformer Q-Networks for Partially Observable Reinforcement Learning
Hust1Booze/flash-linux0.11-talk
你管这破玩意叫操作系统源码 — 像小说一样品读 Linux 0.11 核心代码
Hust1Booze/OpenAI-baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
Hust1Booze/pypop
PyPop7: A Pure-Python Library for POPulation-based Black-Box Optimization (BBO), especially their Large-Scale versions/variants.
Hust1Booze/q-transformer
Implementation of Q-Transformer, Scalable Offline Reinforcement Learning via Autoregressive Q-Functions, out of Google Deepmind
Hust1Booze/retro_branching_offline
Learning to branch with reinforcement learning using retrospective trajectories for exact combinatorial optimisation.
Hust1Booze/spinningup
An educational resource to help anyone learn deep reinforcement learning.
Hust1Booze/transformer-rl-4
Hust1Booze/wmg_agent
WMG agent