Pinned Repositories
trl
Train transformer language models with reinforcement learning.
cleanba
CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL
cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
gym-microrts-paper
The source code for the gym-microrts paper.
invalid-action-masking
Source Code for A Closer Look at Invalid Action Masking in Policy Gradient Algorithms
lm-human-preference-details
RLHF implementation details of OAI's 2019 codebase
portwarden
Create Encrypted Backups of Your Bitwarden Vault with Attachments
PPO-Implementation-Deep-Dive
DEPRECATED - please visit https://github.com/vwxyzjn/ppo-implementation-details
ppo-implementation-details
The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization
summarize_from_feedback_details
vwxyzjn's Repositories
vwxyzjn/gym-microrts-paper
The source code for the gym-microrts paper.
vwxyzjn/gym-pysc2
Gym wrapper for pysc2
vwxyzjn/envpool-cleanrl
vwxyzjn/cleangpt
vwxyzjn/entity-ppo-demo
vwxyzjn/baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
vwxyzjn/envpool-xla-cleanrl
vwxyzjn/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
vwxyzjn/awesome-reinforcement-learning-lib
GitHub's code repository is all you need
vwxyzjn/dm-haiku
JAX-based neural network library
vwxyzjn/dragonfly
A modern replacement for Redis and Memcached
vwxyzjn/enn-trainer
vwxyzjn/enn-zoo
Collection of entity-gym bindings for different reinforcement learning environments.
vwxyzjn/entity-gym
Standard interface for entity based reinforcement learning environments.
vwxyzjn/envpool
C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.
vwxyzjn/flax
Flax is a neural network library for JAX that is designed for flexibility.
vwxyzjn/Gymnasium
A standard API for reinforcement learning and a diverse set of reference environments (formerly Gym)
vwxyzjn/hyperstate
Opinionated library for managing hyperparameters and mutable state of machine learning training systems.
vwxyzjn/IsaacGymEnvs
Isaac Gym Reinforcement Learning Environments
vwxyzjn/jaxrl
JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.
vwxyzjn/minGPT
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
vwxyzjn/moolib
A library for distributed ML training with PyTorch
vwxyzjn/moolib-data
vwxyzjn/poetry12bug
vwxyzjn/rl-experiments
Keeping track of RL experiments
vwxyzjn/rl_games
RL implementations
vwxyzjn/rogue-net
Entity Gym compatible ragged batch transformer implementation.
vwxyzjn/Shimmy
An API conversion tool for popular external reinforcement learning environments
vwxyzjn/v23
Volume 23 of JMLR
vwxyzjn/vwxyzjn.github.io