cby-pku
Sophomore undergrad at Peking University📚 Focus on Scalable Oversight / AI Safety / AI Alignment
Peking UniversityBeijing
Pinned Repositories
align-anything
aligner
[NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct
DeepSpeed-Chat
gpt4_eval
GPT-4 evaluation prompt, accelerated with ray.
pku_ai_basis
The basic demo of the ai_basic_learing_2023_spring_pku
pku_cv_2024
Early computer and mid-level vision
pku_dsa
A review of my code practice when learning pku : data structure and algorithm
pku_ics
A review of my code lab when learning pku : ICS
pku_machine_learning_2023
PKU 2023 -12 Machine Learning Labs
pku_modeling
the demo of jiangzehan_modeling
cby-pku's Repositories
cby-pku/aligner
[NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct
cby-pku/pku_machine_learning_2023
PKU 2023 -12 Machine Learning Labs
cby-pku/pku_ai_basis
The basic demo of the ai_basic_learing_2023_spring_pku
cby-pku/DeepSpeed-Chat
cby-pku/gpt4_eval
GPT-4 evaluation prompt, accelerated with ray.
cby-pku/pku_dsa
A review of my code practice when learning pku : data structure and algorithm
cby-pku/pku_ics
A review of my code lab when learning pku : ICS
cby-pku/pku_modeling
the demo of jiangzehan_modeling
cby-pku/pku_nlp_2024
cby-pku/safe-rlhf
Safe-RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
cby-pku/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
cby-pku/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
cby-pku/accelerate
🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision
cby-pku/align-anything
cby-pku/cby-pku.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
cby-pku/malib
A parallel framework for population-based multi-agent reinforcement learning.
cby-pku/MARLlib
One repository is all that is necessary for Multi-agent Reinforcement Learning (MARL)
cby-pku/omnisafe
OmniSafe is an infrastructural framework for accelerating SafeRL research.
cby-pku/Safe-Policy-Optimization
This is a benchmark repository for safe reinforcement learning algorithms
cby-pku/CookBook
🎉🎉🎉JAVA高级架构师技术栈==任何技能通过 “刻意练习” 都可以达到融会贯通的境界,就像烹饪一样,这里有一份JAVA开发技术手册,只需要增加自己练习的次数。🏃🏃🏃
cby-pku/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
cby-pku/markdown-emoji
Markdown语法支持添加 emoji表情,输入不同的符号码(两个冒号包围的字符)可以显示出不同的表情
cby-pku/OI-wiki
:star2: Wiki of OI / ICPC for everyone. (某大型游戏线上攻略,内含炫酷算术魔法)
cby-pku/pku_ai_society
Code for PKU AI Social Sciences
cby-pku/pku_programming
A review of my code when learning PKU: programming-algorithm
cby-pku/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs