vwxyzjn
Computer Science Ph.D student at Drexel University researching Game Artificial Intelligence
Drexel UniversityPhiladelphia, PA
Pinned Repositories
MicroRTS-Py
A simple and highly efficient RTS-game-inspired environment for reinforcement learning (formerly Gym-MicroRTS)
a2c_is_a_special_case_of_ppo
A2C is a special case of PPO!
cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
gym-microrts-paper
The source code for the gym-microrts paper.
invalid-action-masking
Source Code for A Closer Look at Invalid Action Masking in Policy Gradient Algorithms
jupyter_disqus
Add Disqus to your Jupyter notebook.
portwarden
Create Encrypted Backups of Your Bitwarden Vault with Attachments
PPO-Implementation-Deep-Dive
DEPRECATED - please visit https://github.com/vwxyzjn/ppo-implementation-details
ppo-implementation-details
The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization
SC2AI
Integrated Tensorforce and OpenAI Gym to train SC II game agents.
vwxyzjn's Repositories
vwxyzjn/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
vwxyzjn/portwarden
Create Encrypted Backups of Your Bitwarden Vault with Attachments
vwxyzjn/ppo-implementation-details
The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization
vwxyzjn/invalid-action-masking
Source Code for A Closer Look at Invalid Action Masking in Policy Gradient Algorithms
vwxyzjn/gym-microrts-paper
The source code for the gym-microrts paper.
vwxyzjn/gym-pysc2
Gym wrapper for pysc2
vwxyzjn/envpool-cleanrl
vwxyzjn/ppo-atari-metrics
vwxyzjn/microrts
vwxyzjn/entity-ppo-demo
vwxyzjn/envpool-xla-cleanrl
vwxyzjn/awesome-reinforcement-learning-lib
GitHub's code repository is all you need
vwxyzjn/baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
vwxyzjn/dragonfly
A modern replacement for Redis and Memcached
vwxyzjn/enn-trainer
vwxyzjn/entity-gym
Standard interface for entity based reinforcement learning environments.
vwxyzjn/envpool
C++-based high-performance parallel environment execution engine for general RL environments.
vwxyzjn/flax
Flax is a neural network library for JAX that is designed for flexibility.
vwxyzjn/Gymnasium
A standard API for reinforcement learning and a diverse set of reference environments (formerly Gym)
vwxyzjn/hyperstate
Opinionated library for managing hyperparameters and mutable state of machine learning training systems.
vwxyzjn/IsaacGymEnvs
Isaac Gym Reinforcement Learning Environments
vwxyzjn/minGPT
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
vwxyzjn/poetry12bug
vwxyzjn/rl-experiments
Keeping track of RL experiments
vwxyzjn/rl_games
RL implementations
vwxyzjn/rogue-net
Entity Gym compatible ragged batch transformer implementation.
vwxyzjn/Shimmy
An API conversion tool for popular external reinforcement learning environments
vwxyzjn/torchbeast
A PyTorch Platform for Distributed RL
vwxyzjn/v23
Volume 23 of JMLR
vwxyzjn/vwxyzjn.github.io