typoverflow

Second-year M.Sc student @ School of AI, Nanjing University, focusing on reinforcement learning.

Nanjing UniversityNanjing

Pinned Repositories

ACT
Official code for ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning (AAAI'24)
Language:Python9 0 03
OfflineRL-Lib
Benchmarked implementations of Offline RL Algorithms.
Language:Python62 3 47
.dotfiles
A repo containing bash scripts to deploy reinforcement learning dev environment within one click!
Language:Shell8 2 02
chainy-zsh-theme
Chainy Theme for Oh My ZSH
Language:Shell6 1 02
note
notes for NJU courses
Language:TeX18 1 05
Pirror
基于树莓派（Pi）和PyGame的魔镜（Mirror）
Language:Python16 3 23
pytorch-crf
条件随机场（CRF）的pytorch实现
Language:Python9 1 00
UtilsRL
A python module designed for agile RL algorithm developing.
Language:Python26 4 193
WiseRL
PyTorch implementations for Offline Preference-Based RL (PbRL) algorithms
Language:Python151
unstable_baselines
Re-implementations of SOTA RL algorithms.
Language:Python127 4 712

typoverflow's Repositories

typoverflow/UtilsRL
A python module designed for agile RL algorithm developing.
Language:Python26 4 193
typoverflow/note
notes for NJU courses
Language:TeX18 1 05
typoverflow/Pirror
基于树莓派（Pi）和PyGame的魔镜（Mirror）
Language:Python16 3 23
typoverflow/WiseRL
PyTorch implementations for Offline Preference-Based RL (PbRL) algorithms
Language:Python151
typoverflow/.dotfiles
A repo containing bash scripts to deploy reinforcement learning dev environment within one click!
Language:Shell8 2 02
typoverflow/chainy-zsh-theme
Chainy Theme for Oh My ZSH
Language:Shell6 1 02
typoverflow/Decision-RWKV
Preliminary attempt to use RWKV to achieve infinite context length for decision-making.
Language:Python4 1 01
typoverflow/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)
Language:Python1 0 0
typoverflow/sbx
SBX: Stable Baselines Jax (SB3 + Jax)
Language:Python1 0 0
typoverflow/sync-issue
使用Github Action同步nju.git和github仓库之间的issues。
Language:Python1 1 120
typoverflow/.tmux
🇫🇷 Oh my tmux! My self-contained, pretty & versatile tmux configuration made with ❤️
0 0
typoverflow/alphastar
Language:Python0 0
typoverflow/ChatGPT-Next-Web
One-Click to deploy well-designed ChatGPT web UI on Vercel. 一键拥有你自己的 ChatGPT 网页服务。
Language:TypeScript0 0
typoverflow/CleanDiffuser
CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making
typoverflow/CODAS
The Official Code for Cross-Modal Domain Adaptation for Cost-Efficient Visual Reinforcement Learning
Language:Python0 0
typoverflow/CORL
High-quality single-file implementations of SOTA Offline RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC
Language:Python0 0
typoverflow/hok_env
Language:Python0 0
typoverflow/meta_rl
Meta RL codebase for Unstable Baselines
Language:Python0 0
typoverflow/mirai-bot-manager
Language:Kotlin1 0
typoverflow/MOSS-RLHF
MOSS-RLHF
Language:Python0 0
typoverflow/OfflineRL
A collection of offline reinforcement learning algorithms. This is a mirror repo from https://agit.ai/Polixir/OfflineRL
Language:Python0 0
typoverflow/OfflineRL-Kit
An elegant PyTorch offline reinforcement learning library for researchers.
Language:Python0 0
typoverflow/prose
A clean, minimalist theme featuring a light and dark mode for Ghost
Language:CSS0 0
typoverflow/rl-rep
Representation Learning (RepL) Methods in Reinforcement Learning and Causal Inference
Language:Python0 0
typoverflow/RWKV-CUDA
The CUDA version of the RWKV language model ( https://github.com/BlinkDL/RWKV-LM )
Language:Cuda0 0
typoverflow/sac-rnd
Official implementation for "Anti-Exploration by Random Network Distillation", ICML 2023
Language:Python0 0
typoverflow/tdmpc2
Code for "TD-MPC2: Scalable, Robust World Models for Continuous Control"
Language:Python0 0
typoverflow/typoverflow
Language:Python1 01
typoverflow/typoverflow.github.io
Language:SCSS
typoverflow/unstable_baselines
Language:Python0 0