PKU-Alignment

Loves Sharing and Open-Source, Making AI Safer.

China

Pinned Repositories

align-anything
Align Anything: Training All-modality Model with Feedback
Language:Python260 9 1946
AlignmentSurvey
AI Alignment: A Comprehensive Survey
130 3 10
beavertails
BeaverTails is a collection of datasets designed to facilitate research on safety alignment in large language models (LLMs).
Language:Makefile114 5 75
omnisafe
JMLR: OmniSafe is an infrastructural framework for accelerating SafeRL research.
Language:Python946 39 103132
ProAgent
ProAgent: Building Proactive Cooperative Agents with Large Language Models
Language:JavaScript62 10 17
Safe-Policy-Optimization
NeurIPS 2023: Safe Policy Optimization: A benchmark repository for safe reinforcement learning algorithms
Language:Python331 8 1045
safe-rlhf
Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
Language:Python1.4k 18 85120
safe-sora
SafeSora is a human preference dataset designed to support safety alignment research in the text-to-video generation field, aiming to enhance the helpfulness and harmlessness of Large Vision Models (LVMs).
Language:Python26 4 15
SafeDreamer
ICLR 2024: SafeDreamer: Safe Reinforcement Learning with World Models
Language:Python49 4 37
safety-gymnasium
NeurIPS 2023: Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmark
Language:Python402 10 2752

PKU-Alignment's Repositories

PKU-Alignment/safe-rlhf
Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
Language:Python1.4k 18 85120
PKU-Alignment/omnisafe
JMLR: OmniSafe is an infrastructural framework for accelerating SafeRL research.
Language:Python946 39 103132
PKU-Alignment/safety-gymnasium
NeurIPS 2023: Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmark
Language:Python402 10 2752
PKU-Alignment/Safe-Policy-Optimization
NeurIPS 2023: Safe Policy Optimization: A benchmark repository for safe reinforcement learning algorithms
Language:Python331 8 1045
PKU-Alignment/align-anything
Align Anything: Training All-modality Model with Feedback
Language:Python260 9 1946
PKU-Alignment/AlignmentSurvey
AI Alignment: A Comprehensive Survey
130 3 10
PKU-Alignment/beavertails
BeaverTails is a collection of datasets designed to facilitate research on safety alignment in large language models (LLMs).
Language:Makefile114 5 75
PKU-Alignment/ProAgent
ProAgent: Building Proactive Cooperative Agents with Large Language Models
Language:JavaScript62 10 17
PKU-Alignment/SafeDreamer
ICLR 2024: SafeDreamer: Safe Reinforcement Learning with World Models
Language:Python49 4 37
PKU-Alignment/safe-sora
SafeSora is a human preference dataset designed to support safety alignment research in the text-to-video generation field, aiming to enhance the helpfulness and harmlessness of Large Vision Models (LVMs).
Language:Python26 4 15
PKU-Alignment/ReDMan
ReDMan is an open-source simulation platform that provides a standardized implementation of safe RL algorithms for Reliable Dexterous Manipulation.
Language:Python16 3 02
PKU-Alignment/ProgressGym
Alignment with a millennium of moral progress. Spotlight@NeurIPS 2024 Track on Datasets and Benchmarks.
Language:Python10 1 02
PKU-Alignment/llms-resist-alignment
Repo for paper "Language Models Resist Alignment"
Language:Python4 2 00
PKU-Alignment/.github
0 2 00
PKU-Alignment/aligner
Achieving Efficient Alignment through Learned Correction
PKU-Alignment/Aligner2024.github.io
Language:HTML

PKU-Alignment

Pinned Repositories

align-anything

AlignmentSurvey

beavertails

omnisafe

ProAgent

Safe-Policy-Optimization

safe-rlhf

safe-sora

SafeDreamer

safety-gymnasium

PKU-Alignment's Repositories

PKU-Alignment/safe-rlhf

PKU-Alignment/omnisafe

PKU-Alignment/safety-gymnasium

PKU-Alignment/Safe-Policy-Optimization

PKU-Alignment/align-anything

PKU-Alignment/AlignmentSurvey

PKU-Alignment/beavertails

PKU-Alignment/ProAgent

PKU-Alignment/SafeDreamer

PKU-Alignment/safe-sora

PKU-Alignment/ReDMan

PKU-Alignment/ProgressGym

PKU-Alignment/llms-resist-alignment

PKU-Alignment/.github

PKU-Alignment/aligner

PKU-Alignment/Aligner2024.github.io