yaof20

NLPer

University of California, San DiegoLa Jolla, California

yaof20's Stars

dair-ai/Prompt-Engineering-Guide
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
Language:MDX52.1k 570 2075k
geekan/MetaGPT
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
Language:Python46.5k 904 7285.5k
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Language:Python36.1k 348 2.9k4.2k
openai/chatgpt-retrieval-plugin
The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.
Language:Python21.1k 320 2363.7k
huggingface/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Language:Python16.9k 112 1.1k1.7k
meta-llama/llama-recipes
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama for WhatsApp & Messenger.
Language:Jupyter Notebook15.8k 204 4002.3k
Stability-AI/StableLM
StableLM: Stability AI Language Models
Language:Jupyter Notebook15.8k 201 761k
huggingface/trl
Train transformer language models with reinforcement learning.
Language:Python10.5k 77 1.3k1.4k
artidoro/qlora
QLoRA: Efficient Finetuning of Quantized LLMs
Language:Jupyter Notebook10.1k 85 250826
huggingface/text-generation-inference
Large Language Model Text Generation Inference
Language:Python9.6k 105 1.4k1.1k
NVIDIA/cutlass
CUDA Templates for Linear Algebra Subroutines
Language:C++6k 110 1.2k1k
arcee-ai/mergekit
Tools for merging pretrained large language models.
Language:Python5.1k 52 331469
Facico/Chinese-Vicuna
Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model —— 一个中文低资源的llama+lora方案，结构参考alpaca
Language:C4.2k 58 248419
yuanzhoulvpi2017/zero_nlp
中文nlp解决方案(大模型、数据、模型、训练、推理)
Language:Jupyter Notebook3.1k 30 201378
zjunlp/LLMAgentPapers
Must-read Papers on LLM Agents.
2k 51 10110
microsoft/DeepSpeed-MII
MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.
Language:Python1.9k 42 314176
gpu-mode/resource-stream
GPU programming related news and material links
1.3k 43 278
AGI-Edgerunners/LLM-Agents-Papers
A repo lists papers related to LLM based agent
Language:Python1.2k 34 1177
srush/awesome-o1
A bibliography and survey of the papers surrounding o1
Language:TeX1k 24 041
hollobit/GenAI_LLM_timeline
ChatGPT, GenerativeAI and LLMs Timeline
947 85 459
bigcode-project/bigcode-evaluation-harness
A framework for the evaluation of autoregressive code generation language models.
Language:Python860 12 151223
SinclairCoder/Instruction-Tuning-Papers
Reading list of Instruction-tuning. A trend starts from Natrural-Instruction (ACL 2022), FLAN (ICLR 2022) and T0 (ICLR 2022).
758 16 324
alibaba/Megatron-LLaMA
Best practice for training LLaMA models in Megatron-LM
Language:Python638 7 6455
Jack47/hack-SysML
The road to hack SysML and become an system expert
Language:Emacs Lisp453 32 254
microsoft/GRIN-MoE
GRadient-INformed MoE
260 8 218
Hannibal046/nanoRWKV
The nanoGPT-style implementation of RWKV Language Model - an RNN with GPT-level LLM performance.
Language:Python194 4 912
luban-agi/Awesome-Tool-Learning
A curated list of papers and applications on tool learning.
115 4 14
RZFan525/Awesome-ScalingLaws
A curated list of awesome resources dedicated to Scaling Laws for LLMs
67 1 04
THUlawtech/MUSER
Language:Python22 1 11
kyegomez/FlashLora
FlashAttention2.0 with Lora
Language:Python9 2 0

yaof20

yaof20's Stars

dair-ai/Prompt-Engineering-Guide

geekan/MetaGPT

microsoft/DeepSpeed

openai/chatgpt-retrieval-plugin

huggingface/peft

meta-llama/llama-recipes

Stability-AI/StableLM

huggingface/trl

artidoro/qlora

huggingface/text-generation-inference

NVIDIA/cutlass

arcee-ai/mergekit

Facico/Chinese-Vicuna

yuanzhoulvpi2017/zero_nlp

zjunlp/LLMAgentPapers

microsoft/DeepSpeed-MII

gpu-mode/resource-stream

AGI-Edgerunners/LLM-Agents-Papers

srush/awesome-o1

hollobit/GenAI_LLM_timeline

bigcode-project/bigcode-evaluation-harness

SinclairCoder/Instruction-Tuning-Papers

alibaba/Megatron-LLaMA

Jack47/hack-SysML

microsoft/GRIN-MoE

Hannibal046/nanoRWKV

luban-agi/Awesome-Tool-Learning

RZFan525/Awesome-ScalingLaws

THUlawtech/MUSER

kyegomez/FlashLora