zhaoyang02's Stars
rasbt/LLMs-from-scratch
Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step
lucidrains/denoising-diffusion-pytorch
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
RLHFlow/RLHF-Reward-Modeling
Recipes to train reward model for RLHF.
allenai/reward-bench
RewardBench: the first evaluation tool for reward models.
openai/summarize-from-feedback
Code for "Learning to summarize from human feedback"
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
huggingface/alignment-handbook
Robust recipes to align language models with human and AI preferences
apple/ml-tic-clip
Repository for the paper: "TiC-CLIP: Continual Training of CLIP Models".
vwxyzjn/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
yinyueqin/relative-preference-optimization
Relative Preference Optimization: Enhancing LLM Alignment through Contrasting Responses across Identical and Diverse Prompts
eric-mitchell/direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
Atenrev/diffusion_continual_learning
PyTorch implementation of various distillation approaches for continual learning of Diffusion Models.
BeyonderXX/TRACE
TRACE: A Comprehensive Benchmark for Continual Learning in Large Language Models
LLM-Tuning-Safety/LLMs-Finetuning-Safety
We jailbreak GPT-3.5 Turbo’s safety guardrails by fine-tuning it on only 10 adversarially designed examples, at a cost of less than $0.20 via OpenAI’s APIs.
UIC-Liu-Lab/ContinualLM
An Extensible Continual Learning Framework Focused on Language Models (LMs)
beir-cellar/beir
A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
castorini/docTTTTTquery
docTTTTTquery document expansion model
solidsea98/Neural-Corpus-Indexer-NCI
ncbi/MedCPT
Code for MedCPT, a model for zero-shot biomedical information retrieval.
FlagOpen/FlagEmbedding
Retrieval and Retrieval-augmented LLMs
embeddings-benchmark/mteb
MTEB: Massive Text Embedding Benchmark
NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
zetaalphavector/InPars
Inquisitive Parrots for Search
allegro/allRank
allRank is a framework for training learning-to-rank neural models based on PyTorch.
codertimo/BERT-pytorch
Google AI 2018 BERT pytorch implementation
huggingface/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
JushBJJ/Mr.-Ranedeer-AI-Tutor
A GPT-4 AI Tutor Prompt for customizable personalized learning experiences.
wistbean/learn_python3_spider
python爬虫教程系列、从0到1学习python爬虫,包括浏览器抓包,手机APP抓包,如 fiddler、mitmproxy,各种爬虫涉及的模块的使用,如:requests、beautifulSoup、selenium、appium、scrapy等,以及IP代理,验证码识别,Mysql,MongoDB数据库的python使用,多线程多进程爬虫的使用,css 爬虫加密逆向破解,JS爬虫逆向,分布式爬虫,爬虫项目实战实例等
openai/openai-cookbook
Examples and guides for using the OpenAI API
imoneoi/openchat
OpenChat: Advancing Open-source Language Models with Imperfect Data