PKU-YYang

PKU-YYang's Stars

google-deepmind/pysc2
StarCraft II Learning Environment
Language:Python8k 347 2821.2k
baichuan-inc/Baichuan-7B
A large-scale 7B pretraining language model developed by BaiChuan-Inc.
Language:Python5.7k 68 128507
blei-lab/edward
A probabilistic programming language in TensorFlow. Deep generative models, variational inference.
Language:Jupyter Notebook4.8k 272 514758
PKU-Alignment/safe-rlhf
Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
Language:Python1.4k 18 88119
huawei-noah/SMARTS
Scalable Multi-Agent RL Training School for Autonomous Driving
Language:Python977 14 1k192
Replicable-MARL/MARLlib
One repository is all that is necessary for Multi-agent Reinforcement Learning (MARL)
Language:Python975 11 156160
PKU-Alignment/omnisafe
JMLR: OmniSafe is an infrastructural framework for accelerating SafeRL research.
Language:Python964 40 108132
upb-lea/reinforcement_learning_course_materials
Lecture notes, tutorial tasks including solutions as well as online videos for the reinforcement learning course hosted by Paderborn University
Language:Jupyter Notebook954 31 15217
PKU-MARL/DexterousHands
This is a library that provides dual dexterous hand manipulation tasks through Isaac Gym
Language:Python687 15 4484
PKU-MARL/HARL
Official implementation of HARL algorithms based on PyTorch.
Language:Python561 9 5266
metaopt/torchopt
TorchOpt is an efficient library for differentiable optimization built upon PyTorch.
Language:Python558 14 3935
sjtu-marl/malib
A parallel framework for population-based multi-agent reinforcement learning.
Language:Python512 10 3662
PKU-Alignment/align-anything
Align Anything: Training All-modality Model with Feedback
Language:Python337 11 2065
PKU-Alignment/beavertails
BeaverTails is a collection of datasets designed to facilitate research on safety alignment in large language models (LLMs).
Language:Makefile119 6 75
loliverhennigh/All-Convnet-Autoencoder-Example
Just a simple use example of the conv2d_transpose function in TensorFlow. Its run on MNIST.
Language:Python22 4 312
diversepsro/diverse_psro
Language:Python18 1 18
pair-lab/pair-lab.github.io
Language:TeX10