Pinned Repositories
trl
Train transformer language models with reinforcement learning.
cleanba
CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL
cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
gym-microrts-paper
The source code for the gym-microrts paper.
invalid-action-masking
Source Code for A Closer Look at Invalid Action Masking in Policy Gradient Algorithms
lm-human-preference-details
RLHF implementation details of OAI's 2019 codebase
portwarden
Create Encrypted Backups of Your Bitwarden Vault with Attachments
PPO-Implementation-Deep-Dive
DEPRECATED - please visit https://github.com/vwxyzjn/ppo-implementation-details
ppo-implementation-details
The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization
summarize_from_feedback_details
vwxyzjn's Repositories
vwxyzjn/SC2AI
Integrated Tensorforce and OpenAI Gym to train SC II game agents.
vwxyzjn/CS583
vwxyzjn/CS583FinalProject
vwxyzjn/CS618
vwxyzjn/ivideo
一个可以观看国内主流视频平台所有视频的客户端(Mac、Windows、Linux),包括 VIP 资源
vwxyzjn/Resume-master
vwxyzjn/RLControlSkipFrames
vwxyzjn/admin
vwxyzjn/bouncer
Validation for go http handlers
vwxyzjn/chat
Instant messaging server; backend in Go; iOS, Android, web, command line clients; chatbots
vwxyzjn/cocalc_docker_python3
vwxyzjn/constitution
中华人民共和国宪法
vwxyzjn/docker-jitsi-meet
Jitsi Meet on Docker
vwxyzjn/elivel
vwxyzjn/fucommencement
vwxyzjn/fucommencement-backend
vwxyzjn/mytrojan
vwxyzjn/OpenAPI-Specification
The OpenAPI Specification Repository
vwxyzjn/otpify
Use otpify to host your own otp client
vwxyzjn/pgmpy
Python Library for Inference (Causal and Probabilistic) and learning in Bayesian Networks
vwxyzjn/portwarden-frontend
The front end of portwarden
vwxyzjn/ptan
PyTorch Agent Net: reinforcement learning toolkit for pytorch
vwxyzjn/ReDoc
📘 OpenAPI/Swagger-generated API Reference Documentation
vwxyzjn/RemixIcon
Open source neutral style icon system
vwxyzjn/StatsHW
vwxyzjn/tccv-bullet-points
vwxyzjn/tensorflow-beginner
vwxyzjn/tensorflowbyexample
Make tensorflow more practical and less magical
vwxyzjn/whelpsite
vwxyzjn/yakuake-session
A script to create new yakuake sessions from command-line or '.desktop' files. It allows yakuake to be a better replacement of konsole.