zhaoyang02

Undergraduate student of Weiyang College, Tsinghua University.

Tsinghua University

zhaoyang02's Stars

rasbt/LLMs-from-scratch
Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step
Language:Jupyter Notebook29.4k3.4k
lucidrains/denoising-diffusion-pytorch
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
Language:Python8.1k1k
RLHFlow/RLHF-Reward-Modeling
Recipes to train reward model for RLHF.
Language:Python74263
allenai/reward-bench
RewardBench: the first evaluation tool for reward models.
Language:Python38748
openai/summarize-from-feedback
Code for "Learning to summarize from human feedback"
Language:Python983143
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Language:Python32.5k4k
huggingface/alignment-handbook
Robust recipes to align language models with human and AI preferences
Language:Python4.6k397
apple/ml-tic-clip
Repository for the paper: "TiC-CLIP: Continual Training of CLIP Models".
Language:Python936
vwxyzjn/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Language:Python5.5k623
yinyueqin/relative-preference-optimization
Relative Preference Optimization: Enhancing LLM Alignment through Contrasting Responses across Identical and Diverse Prompts
Language:Python181
eric-mitchell/direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
Language:Python2.1k172
Atenrev/diffusion_continual_learning
PyTorch implementation of various distillation approaches for continual learning of Diffusion Models.
Language:Python161
BeyonderXX/TRACE
TRACE: A Comprehensive Benchmark for Continual Learning in Large Language Models
Language:Python578
LLM-Tuning-Safety/LLMs-Finetuning-Safety
We jailbreak GPT-3.5 Turbo’s safety guardrails by fine-tuning it on only 10 adversarially designed examples, at a cost of less than $0.20 via OpenAI’s APIs.
Language:Python22825
UIC-Liu-Lab/ContinualLM
An Extensible Continual Learning Framework Focused on Language Models (LMs)
Language:Python24519
beir-cellar/beir
A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
Language:Python1.6k191
castorini/docTTTTTquery
docTTTTTquery document expansion model
Language:Python35434
solidsea98/Neural-Corpus-Indexer-NCI
Language:Python15120
ncbi/MedCPT
Code for MedCPT, a model for zero-shot biomedical information retrieval.
Language:Python13115
FlagOpen/FlagEmbedding
Retrieval and Retrieval-augmented LLMs
Language:Python7.1k521
embeddings-benchmark/mteb
MTEB: Massive Text Embedding Benchmark
Language:Jupyter Notebook1.9k250
NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
Language:Python10.3k2.3k
zetaalphavector/InPars
Inquisitive Parrots for Search
Language:Python17718
allegro/allRank
allRank is a framework for training learning-to-rank neural models based on PyTorch.
Language:Python859119
codertimo/BERT-pytorch
Google AI 2018 BERT pytorch implementation
Language:Python6.2k1.3k
huggingface/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Language:Python16.1k1.6k
JushBJJ/Mr.-Ranedeer-AI-Tutor
A GPT-4 AI Tutor Prompt for customizable personalized learning experiences.
28.7k3.3k
wistbean/learn_python3_spider
python爬虫教程系列、从0到1学习python爬虫，包括浏览器抓包，手机APP抓包，如 fiddler、mitmproxy，各种爬虫涉及的模块的使用，如：requests、beautifulSoup、selenium、appium、scrapy等，以及IP代理，验证码识别，Mysql，MongoDB数据库的python使用，多线程多进程爬虫的使用，css 爬虫加密逆向破解，JS爬虫逆向，分布式爬虫，爬虫项目实战实例等
Language:Python18.1k3.7k
openai/openai-cookbook
Examples and guides for using the OpenAI API
Language:MDX59.2k9.4k
imoneoi/openchat
OpenChat: Advancing Open-source Language Models with Imperfect Data
Language:Python5.2k400

zhaoyang02

zhaoyang02's Stars

rasbt/LLMs-from-scratch

lucidrains/denoising-diffusion-pytorch

RLHFlow/RLHF-Reward-Modeling

allenai/reward-bench

openai/summarize-from-feedback

hiyouga/LLaMA-Factory

huggingface/alignment-handbook

apple/ml-tic-clip

vwxyzjn/cleanrl

yinyueqin/relative-preference-optimization

eric-mitchell/direct-preference-optimization

Atenrev/diffusion_continual_learning

BeyonderXX/TRACE

LLM-Tuning-Safety/LLMs-Finetuning-Safety

UIC-Liu-Lab/ContinualLM

beir-cellar/beir

castorini/docTTTTTquery

solidsea98/Neural-Corpus-Indexer-NCI

ncbi/MedCPT

FlagOpen/FlagEmbedding

embeddings-benchmark/mteb

NVIDIA/Megatron-LM

zetaalphavector/InPars

allegro/allRank

codertimo/BERT-pytorch

huggingface/peft

JushBJJ/Mr.-Ranedeer-AI-Tutor

wistbean/learn_python3_spider

openai/openai-cookbook

imoneoi/openchat