zhangyuansen

zhangyuansen's Stars

moymix/TaskMatrix
Language:Python34.5k 309 3483.3k
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python19.7k 157 1.5k2.2k
ymcui/Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
Language:Python18.3k 185 7311.9k
WooooDyy/LLM-Agent-Paper-List
The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
6.5k 133 13387
microsoft/DeepSpeedExamples
Example models using DeepSpeed
Language:Python6k 74 5341k
llm-attacks/llm-attacks
Universal and Transferable Attacks on Aligned Language Models
Language:Python3.4k 34 94468
modelscope/swift
ms-swift: Use PEFT or Full-parameter to finetune 300+ LLMs or 50+ MLLMs. (Qwen2, GLM4v, Internlm2.5, Yi, Llama3.1, Llava-Video, Internvl2, MiniCPM-V-2.6, Deepseek, Baichuan2, Gemma2, Phi3-Vision, ...)
Language:Python2.7k 19 758245
eric-mitchell/direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
Language:Python2.1k 19 81172
Timothyxxx/Chain-of-ThoughtsPapers
A trend starts from "Chain of Thought Prompting Elicits Reasoning in Large Language Models".
1.9k 49 3129
zjunlp/EasyEdit
[ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.
Language:Jupyter Notebook1.8k 23 309219
yechens/NL2SQL
Text2SQL 语义解析数据集、解决方案、paper资源整合项目
1.2k 23 9154
zixian2021/AI-interview-cards
最完整的AI算法面试题目仓库，1000道，25个类目
1.1k 9 389
yule-BUAA/MergeLM
Codebase for Merging Language Models (ICML 2024)
Language:Python757 7 4044
ucinlp/autoprompt
AutoPrompt: Automatic Prompt Construction for Masked Language Models.
Language:Python593 11 3182
facebookresearch/GradientEpisodicMemory
Continuum Learning with GEM: Gradient Episodic Memory
Language:Python393 14 673
BeyonderXX/InstructUIE
Universal information extraction with instruction learning
Language:Python363 6 4033
JH-LEE-KR/l2p-pytorch
PyTorch Implementation of Learning to Prompt (L2P) for Continual Learning @ CVPR22
Language:Python175 2 1223
yunqing-me/AttackVLM
[NeurIPS-2023] Annual Conference on Neural Information Processing Systems
Language:Python157 2 226
thunlp/OpenBackdoor
An open-source toolkit for textual backdoor attack and defense (NeurIPS 2022 D&B, Spotlight)
Language:Python148 10 3122
cmnfriend/O-LoRA
Language:Python143 1 3017
neural-dialogue-metrics/Distinct-N
Compute Distinct-N metric proposed by Jiwei Li et al.
Language:Python114 2 211
arazd/ProgressivePrompts
Progressive Prompts: Continual Learning for Language Models
Language:Python89 2 411
reasoning-machines/CoCoGen
Language Models of Code are Few-Shot Commonsense Learners (EMNLP 2022)
Language:Python86 6 211
hkust-nlp/PEM_composition
[NeurIPS 2023] Github repository for "Composing Parameter-Efficient Modules with Arithmetic Operations"
Language:Python58 3 610
BeyonderXX/TRACE
TRACE: A Comprehensive Benchmark for Continual Learning in Large Language Models
Language:Python57 2 38
h3lio5/episodic-lifelong-learning
Implementation of "Episodic Memory in Lifelong Language Learning"(NeurIPS 2019) in Pytorch
Language:Python57 4 48
SALT-NLP/IDBR
Codes for the paper: "Continual Learning for Text Classification with Information Disentanglement Based Regularization"
Language:Python42 6 12
MehdiAbbanaBennani/continual-learning-ogdplus
Code for "Generalisation Guarantees for Continual Learning with Orthogonal Gradient Descent" (ICML 2020 - Lifelong Learning Workshop)
Language:Shell40 4 26
cambridgeltl/ClaPS
Survival of the Most Influential Prompts: Efficient Black-Box Prompt Search via Clustering and Pruning (Zhou et al.; EMNLP 2023 Findings)
Language:Python16 5 01
meryemmhamdi1/x-continuous-learning
This is to analyze and visualize different parameters and components in the architectures of different cross-lingual downstream tasks to detect and treat catastrophic forgetting.
Language:Jupyter Notebook7 4 02