typoverflow
Second-year M.Sc student @ School of AI, Nanjing University, focusing on reinforcement learning.
Nanjing UniversityNanjing
Pinned Repositories
ACT
Official code for ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning (AAAI'24)
OfflineRL-Lib
Benchmarked implementations of Offline RL Algorithms.
.dotfiles
A repo containing bash scripts to deploy reinforcement learning dev environment within one click!
chainy-zsh-theme
Chainy Theme for Oh My ZSH
note
notes for NJU courses
Pirror
基于树莓派(Pi)和PyGame的魔镜(Mirror)
pytorch-crf
条件随机场(CRF)的pytorch实现
UtilsRL
A python module designed for agile RL algorithm developing.
WiseRL
PyTorch implementations for Offline Preference-Based RL (PbRL) algorithms
unstable_baselines
Re-implementations of SOTA RL algorithms.
typoverflow's Repositories
typoverflow/UtilsRL
A python module designed for agile RL algorithm developing.
typoverflow/note
notes for NJU courses
typoverflow/Pirror
基于树莓派(Pi)和PyGame的魔镜(Mirror)
typoverflow/WiseRL
PyTorch implementations for Offline Preference-Based RL (PbRL) algorithms
typoverflow/.dotfiles
A repo containing bash scripts to deploy reinforcement learning dev environment within one click!
typoverflow/chainy-zsh-theme
Chainy Theme for Oh My ZSH
typoverflow/Decision-RWKV
Preliminary attempt to use RWKV to achieve infinite context length for decision-making.
typoverflow/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)
typoverflow/sbx
SBX: Stable Baselines Jax (SB3 + Jax)
typoverflow/sync-issue
使用Github Action同步nju.git和github仓库之间的issues。
typoverflow/.tmux
🇫🇷 Oh my tmux! My self-contained, pretty & versatile tmux configuration made with ❤️
typoverflow/alphastar
typoverflow/ChatGPT-Next-Web
One-Click to deploy well-designed ChatGPT web UI on Vercel. 一键拥有你自己的 ChatGPT 网页服务。
typoverflow/CleanDiffuser
CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making
typoverflow/CODAS
The Official Code for Cross-Modal Domain Adaptation for Cost-Efficient Visual Reinforcement Learning
typoverflow/CORL
High-quality single-file implementations of SOTA Offline RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC
typoverflow/hok_env
typoverflow/meta_rl
Meta RL codebase for Unstable Baselines
typoverflow/mirai-bot-manager
typoverflow/MOSS-RLHF
MOSS-RLHF
typoverflow/OfflineRL
A collection of offline reinforcement learning algorithms. This is a mirror repo from https://agit.ai/Polixir/OfflineRL
typoverflow/OfflineRL-Kit
An elegant PyTorch offline reinforcement learning library for researchers.
typoverflow/prose
A clean, minimalist theme featuring a light and dark mode for Ghost
typoverflow/rl-rep
Representation Learning (RepL) Methods in Reinforcement Learning and Causal Inference
typoverflow/RWKV-CUDA
The CUDA version of the RWKV language model ( https://github.com/BlinkDL/RWKV-LM )
typoverflow/sac-rnd
Official implementation for "Anti-Exploration by Random Network Distillation", ICML 2023
typoverflow/tdmpc2
Code for "TD-MPC2: Scalable, Robust World Models for Continuous Control"
typoverflow/typoverflow
typoverflow/typoverflow.github.io
typoverflow/unstable_baselines