zmsn-2077
Ph.D. student at Peking University. Interested in Coding & LLM (Safe) Alignment, @PKU-Alignment.
Peking University, Beijing
Pinned Repositories
omnisafe
JMLR: OmniSafe is an infrastructural framework for accelerating SafeRL research.
Safe-Policy-Optimization
NeurIPS 2023: Safe Policy Optimization: A benchmark repository for safe reinforcement learning algorithms
safe-rlhf
Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
DexterousHands
This is a library that provides dual dexterous hand manipulation tasks through Isaac Gym
align-anything
Align Anything: Training All-modality Model with Feedback
CUP-safe-rl
NeurIPS2022: Constrained Update Projection Approach to Safe Policy Optimization
Dev-Setup-Jiaming
Automation scripts for setting up a basic development environment.
omnisafe_zmsn
OmniSafe is a comprehensive and reliable benchmark for safe reinforcement learning.
RLHFTest
Safe-Policy-Optimization
This is a benchmark repository for safe reinforcement learning algorithms
zmsn-2077's Repositories
zmsn-2077/CUP-safe-rl
NeurIPS2022: Constrained Update Projection Approach to Safe Policy Optimization
zmsn-2077/Dev-Setup-Jiaming
Automation scripts for setting up a basic development environment.
zmsn-2077/omnisafe_zmsn
OmniSafe is a comprehensive and reliable benchmark for safe reinforcement learning.
zmsn-2077/RLHFTest
zmsn-2077/Safe-Policy-Optimization
This is a benchmark repository for safe reinforcement learning algorithms
zmsn-2077/align-anything
Align Anything: Training All-modality Model with Feedback
zmsn-2077/baichuan-7B
A large-scale 7B pretraining language model developed by Baichuan
zmsn-2077/draggable-example
vue.draggable example
zmsn-2077/functorch
functorch is JAX-like composable function transforms for PyTorch.
zmsn-2077/Gymnasium
A standard API for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)
zmsn-2077/RRHF
RRHF & Wombat
zmsn-2077/safe-rlhf-dev
Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
zmsn-2077/safety-gymnasium-zmsn
Safety-Gymnaisum is a highly scalable and customizable safe reinforcement learning environment library.
zmsn-2077/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
zmsn-2077/starter-hugo-research-group
zmsn-2077/tianshou
An elegant PyTorch deep reinforcement learning library.
zmsn-2077/tldr
📚 Collaborative cheatsheets for console commands
zmsn-2077/torchopt
TorchOpt is an efficient library for differentiable optimization built upon PyTorch.