nuster1128

GSAI, Renmin University of ChinaBeijing

nuster1128's Stars

huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python133k 1.1k 15.8k26.5k
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Language:Python36.6k 348 1.8k4.5k
LC044/WeChatMsg
提取微信聊天记录，将其导出成HTML、Word、Excel文档永久保存，对聊天记录进行分析生成年度聊天报告，用聊天数据训练专属于个人的AI聊天助手
Language:Python33.6k 171 4023.5k
tatsu-lab/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
Language:Python29.4k 339 2684k
tloen/alpaca-lora
Instruct-tune LLaMA on consumer hardware
Language:Jupyter Notebook18.6k 153 4692.2k
huggingface/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Language:Python16k 107 1k1.6k
THUDM/ChatGLM3
ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型
Language:Python13.4k 98 7771.6k
meta-llama/llama-recipes
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama for WhatsApp & Messenger.
Language:Jupyter Notebook11.9k 96 3401.7k
microsoft/LoRA
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
Language:Python10.4k 68 105668
google/sentencepiece
Unsupervised text tokenizer for Neural Network-based text generation.
Language:C++10.1k 127 7451.2k
huggingface/trl
Train transformer language models with reinforcement learning.
Language:Python9.6k 74 1.1k1.2k
brightmart/nlp_chinese_corpus
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
9.4k 286 451.5k
liguodongiot/llm-action
本项目旨在分享大模型相关技术原理以及实战经验。
Language:HTML9.4k 81 21917
FlagOpen/FlagEmbedding
Retrieval and Retrieval-augmented LLMs
Language:Python7k 44 998511
yangjianxin1/Firefly
Firefly: 大模型训练工具，支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
Language:Python5.7k 56 279518
wgwang/awesome-LLMs-In-China
**大模型
5.3k 106 25438
open-compass/opencompass
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
Language:Python3.8k 23 520406
mymusise/ChatGLM-Tuning
基于ChatGLM-6B + LoRA的Fintune方案
Language:Python3.7k 31 247440
hiyouga/ChatGLM-Efficient-Tuning
Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调
Language:Python3.6k 32 374471
wdndev/llm_interview_note
主要记录大语言大模型（LLMs）算法（应用）工程师相关的知识及面试题
Language:HTML2.9k 12 6336
ysymyth/ReAct
[ICLR 2023] ReAct: Synergizing Reasoning and Acting in Language Models
Language:Jupyter Notebook1.9k 16 29189
zjunlp/EasyEdit
[ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.
Language:Jupyter Notebook1.8k 22 308214
jackaduma/awesome_LLMs_interview_notes
LLMs interview notes and answers:该仓库主要记录大模型（LLMs）算法工程师相关的面试题和参考答案
1.1k 17 6253
AGI-Edgerunners/LLM-Adapters
Code for our EMNLP 2023 Paper: "LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models"
Language:Python1.1k 12 5899
AIoT-MLSys-Lab/Efficient-LLMs-Survey
[TMLR 2024] Efficient Large Language Models: A Survey
970 24 1182
RobustNLP/CipherChat
A framework to evaluate the generalization capability of safety alignment for LLMs
Language:Python562 9 062
alfworld/alfworld
ALFWorld: Aligning Text and Embodied Environments for Interactive Learning
Language:Python341 8 7651
zhongwanjun/MemoryBank-SiliconFriend
Source code and demo for memory bank and SiliconFriend
Language:Python178 7 1724
jihoontack/MAC
Online Adaptation of Language Models with a Memory of Amortized Contexts (NeurIPS 2024)
Language:Python52 1 22
AngxiaoYue/awesome-llm-tool-learning
A list of awesome papers on LLM tool learning.
16 1 11

nuster1128

nuster1128's Stars

huggingface/transformers

lm-sys/FastChat

LC044/WeChatMsg

tatsu-lab/stanford_alpaca

tloen/alpaca-lora

huggingface/peft

THUDM/ChatGLM3

meta-llama/llama-recipes

microsoft/LoRA

google/sentencepiece

huggingface/trl

brightmart/nlp_chinese_corpus

liguodongiot/llm-action

FlagOpen/FlagEmbedding

yangjianxin1/Firefly

wgwang/awesome-LLMs-In-China

open-compass/opencompass

mymusise/ChatGLM-Tuning

hiyouga/ChatGLM-Efficient-Tuning

wdndev/llm_interview_note

ysymyth/ReAct

zjunlp/EasyEdit

jackaduma/awesome_LLMs_interview_notes

AGI-Edgerunners/LLM-Adapters

AIoT-MLSys-Lab/Efficient-LLMs-Survey

RobustNLP/CipherChat

alfworld/alfworld

zhongwanjun/MemoryBank-SiliconFriend

jihoontack/MAC

AngxiaoYue/awesome-llm-tool-learning