exlaw

Aiwei Liu, a Ph.D. student at Tsinghua University Focus on Natural Language Processing

Tsinghua UniversityBeijing, China

exlaw's Stars

hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Language:Python36.9k 219 5.6k4.5k
meta-llama/llama-recipes
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama for WhatsApp & Messenger.
Language:Jupyter Notebook15.8k 204 3982.3k
huggingface/trl
Train transformer language models with reinforcement learning.
Language:Python10.5k 77 1.3k1.4k
EleutherAI/lm-evaluation-harness
A framework for few-shot evaluation of language models.
Language:Python7.3k 39 1.2k2k
predibase/lorax
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
Language:Python2.3k 33 253149
Niek/chatgpt-web
ChatGPT web interface using the OpenAI API
Language:Svelte1.9k 21 170476
yule-BUAA/MergeLM
Codebase for Merging Language Models (ICML 2024)
Language:Python789 8 4246
sail-sg/lorahub
[COLM 2024] LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition
Language:Python604 11 2236
THUYimingLi/BackdoorBox
The open-sourced Python toolbox for backdoor attacks and defenses.
Language:Python482 7 2876
THU-BPM/MarkLLM
MarkLLM: An Open-Source Toolkit for LLM Watermarking.（EMNLP 2024 Demo）
Language:Jupyter Notebook313 6 1435
chrisliu298/awesome-llm-unlearning
A resource repository for machine unlearning in large language models
259 6 214
Ablustrund/LoRAMoE
LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment
Language:Python257 2 1819
Libr-AI/do-not-answer
Do-Not-Answer: A Dataset for Evaluating Safeguards in LLMs
Language:Jupyter Notebook199 5 125
ICTMCG/LLM-for-misinformation-research
Paper list of misinformation research using (multi-modal) large language models, i.e., (M)LLMs.
171 7 07
uukuguy/multi_loras
Load multiple LoRA modules simultaneously and automatically switch the appropriate combination of LoRA modules to generate the best answer based on user queries.
Language:Python149 6 410
andyrdt/refusal_direction
Code and results accompanying the paper "Refusal in Language Models Is Mediated by a Single Direction".
Language:Python139 4 628
wuhy68/Parameter-Efficient-MoE
Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks
Language:Python134 4 818
ericwtodd/function_vectors
Function Vectors in Large Language Models (ICLR 2024)
Language:Python126 4 1229
GCYZSL/MoLA
Language:Python119 2 227
PKU-Alignment/beavertails
BeaverTails is a collection of datasets designed to facilitate research on safety alignment in large language models (LLMs).
Language:Makefile117 6 75
TUDB-Labs/MixLoRA
State-of-the-art Parameter-Efficient MoE Fine-tuning Method
Language:Python106 3 109
LucyDYu/Awesome-Multimodal-Continual-Learning
99 2 22
chrisliu298/awesome-representation-engineering
A resource repository for representation engineering in large language models
81 1 24
vinusankars/Reliability-of-AI-text-detectors
Can AI-Generated Text be Reliably Detected?
Language:Python65 2 22
rishub-tamirisa/tamper-resistance
Official Repository for "Tamper-Resistant Safeguards for Open-Weight LLMs"
Language:Python42 1 105
ydyjya/LLM-IHS-Explanation
Language:Jupyter Notebook37 2 33
THU-BPM/WaterSeeker
WaterSeeker: Pioneering Efficient Detection of Watermarked Segments in Large Documents
Language:Python71
LeiLiLab/llm_watermark_tutorial
Language:JavaScript4 1 00
survey-text-watermark/survey-text-watermark.github.io
Language:HTML3 1 00
exlaw/DLMA
Code and data for paper "Direct Large Language Model Alignment Through Self-Rewarding Contrastive Prompt Distillation" accepted by ACL 2024.
Language:Python2 2 00

exlaw

exlaw's Stars

hiyouga/LLaMA-Factory

meta-llama/llama-recipes

huggingface/trl

EleutherAI/lm-evaluation-harness

predibase/lorax

Niek/chatgpt-web

yule-BUAA/MergeLM

sail-sg/lorahub

THUYimingLi/BackdoorBox

THU-BPM/MarkLLM

chrisliu298/awesome-llm-unlearning

Ablustrund/LoRAMoE

Libr-AI/do-not-answer

ICTMCG/LLM-for-misinformation-research

uukuguy/multi_loras

andyrdt/refusal_direction

wuhy68/Parameter-Efficient-MoE

ericwtodd/function_vectors

GCYZSL/MoLA

PKU-Alignment/beavertails

TUDB-Labs/MixLoRA

LucyDYu/Awesome-Multimodal-Continual-Learning

chrisliu298/awesome-representation-engineering

vinusankars/Reliability-of-AI-text-detectors

rishub-tamirisa/tamper-resistance

ydyjya/LLM-IHS-Explanation

THU-BPM/WaterSeeker

LeiLiLab/llm_watermark_tutorial

survey-text-watermark/survey-text-watermark.github.io

exlaw/DLMA