PKU-YYang's Stars
PKU-Alignment/align-anything
Align Anything: Training All-modality Model with Feedback
PKU-Alignment/beavertails
BeaverTails is a collection of datasets designed to facilitate research on safety alignment in large language models (LLMs).
pair-lab/pair-lab.github.io
baichuan-inc/Baichuan-7B
A large-scale 7B pretraining language model developed by BaiChuan-Inc.
PKU-Alignment/safe-rlhf
Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
PKU-MARL/HARL
Official implementation of HARL algorithms based on PyTorch.
Replicable-MARL/MARLlib
One repository is all that is necessary for Multi-agent Reinforcement Learning (MARL)
PKU-Alignment/omnisafe
JMLR: OmniSafe is an infrastructural framework for accelerating SafeRL research.
upb-lea/reinforcement_learning_course_materials
Lecture notes, tutorial tasks including solutions as well as online videos for the reinforcement learning course hosted by Paderborn University
PKU-MARL/DexterousHands
This is a library that provides dual dexterous hand manipulation tasks through Isaac Gym
metaopt/torchopt
TorchOpt is an efficient library for differentiable optimization built upon PyTorch.
huawei-noah/SMARTS
Scalable Multi-Agent RL Training School for Autonomous Driving
sjtu-marl/malib
A parallel framework for population-based multi-agent reinforcement learning.
diversepsro/diverse_psro
google-deepmind/pysc2
StarCraft II Learning Environment
blei-lab/edward
A probabilistic programming language in TensorFlow. Deep generative models, variational inference.
loliverhennigh/All-Convnet-Autoencoder-Example
Just a simple use example of the conv2d_transpose function in TensorFlow. Its run on MNIST.