Pinned Repositories
trl
Train transformer language models with reinforcement learning.
cleanba
CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL
cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
gym-microrts-paper
The source code for the gym-microrts paper.
invalid-action-masking
Source Code for A Closer Look at Invalid Action Masking in Policy Gradient Algorithms
lm-human-preference-details
RLHF implementation details of OAI's 2019 codebase
portwarden
Create Encrypted Backups of Your Bitwarden Vault with Attachments
PPO-Implementation-Deep-Dive
DEPRECATED - please visit https://github.com/vwxyzjn/ppo-implementation-details
ppo-implementation-details
The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization
summarize_from_feedback_details
vwxyzjn's Repositories
vwxyzjn/jupyter_disqus
Add Disqus to your Jupyter notebook.
vwxyzjn/LP_optimization_python
Linear Programming for Optimal Scheduling by Using Gurobipy
vwxyzjn/Sentiment-Analysis-LSTM
Used neural network to classify movie reviews based on sentiment
vwxyzjn/vuetify-parallax-starter2
vwxyzjn/admin
vwxyzjn/algoliasearch-client-go
:mag: Algolia Search API Client for Go
vwxyzjn/assignment1-demo
vwxyzjn/cocalc_docker_python3
vwxyzjn/constitution
中华人民共和国宪法
vwxyzjn/cover_letters
vwxyzjn/docs
Documentation for Vuetify.js
vwxyzjn/fuckzhxhs
本网页因被火爆分享被微信判定为恶意诱导分享,请于浏览器中打开。
vwxyzjn/fucommencement
vwxyzjn/hello_cargo
vwxyzjn/histraffic
vwxyzjn/info_collection
vwxyzjn/parallax-starter
Vuetify parallax starter theme
vwxyzjn/penspider
A web crawler that crawls the fountain pens listing
vwxyzjn/Plane-Shooting-Problem-Dynamic-Programming
Dynamic Programming
vwxyzjn/sc2aibot
Implementing reinforcement-learning algorithms for pysc2 -environment
vwxyzjn/sc2gym
PySC2 OpenAI Gym Environments
vwxyzjn/Summer-Research
vwxyzjn/tccv-1
Two columns curriculum vitae
vwxyzjn/tensorforce
TensorForce: A TensorFlow library for applied reinforcement learning
vwxyzjn/Text-to-Image
Convert your text to a grayscale image and back.
vwxyzjn/TextBlob
Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.
vwxyzjn/vue
A progressive, incrementally-adoptable JavaScript framework for building UI on the web.
vwxyzjn/vuetify-landing-starter
vwxyzjn/vuetify-tab-router-demo
vwxyzjn/vuetify-upload-demo