yty3805595

yty3805595's Stars

meta-llama/llama3
The official Meta Llama 3 GitHub site
Language:Python26.1k 214 2362.9k
unslothai/unsloth
Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
Language:Python15.5k 101 7921k
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Language:Python13.3k 99 1k1.1k
openai/tiktoken
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
Language:Python11.8k 169 230801
netease-youdao/QAnything
Question and Answer based on Anything.
Language:Python11.4k 101 3911.1k
SJTU-IPADS/PowerInfer
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
Language:C++7.9k 75 156403
QwenLM/Qwen2
Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.
Language:Shell7.4k 42 769454
ymcui/Chinese-LLaMA-Alpaca-2
中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)
Language:Python7k 78 388577
FlagOpen/FlagEmbedding
Retrieval and Retrieval-augmented LLMs
Language:Python6.8k 42 983487
huggingface/alignment-handbook
Robust recipes to align language models with human and AI preferences
Language:Python4.5k 107 133386
modelscope/swift
ms-swift: Use PEFT or Full-parameter to finetune 300+ LLMs or 50+ MLLMs. (Qwen2, GLM4v, Internlm2.5, Yi, Llama3.1, Llava-Video, Internvl2, MiniCPM-V-2.6, Deepseek, Baichuan2, Gemma2, Phi3-Vision, ...)
Language:Python2.7k 19 758245
Tongji-KGLLM/RAG-Survey
1.7k 31 16118
netease-youdao/BCEmbedding
Netease Youdao's open-source embedding and reranker models for RAG products.
Language:Python1.4k 8 7890
jquesnelle/yarn
YaRN: Efficient Context Window Extension of Large Language Models
Language:Python1.3k 14 56115
peremartra/Large-Language-Model-Notebooks-Course
Practical course about Large Language Models.
Language:Jupyter Notebook1.1k 26 9287
thunlp/WebCPM
Official codes for ACL 2023 paper "WebCPM: Interactive Web Search for Chinese Long-form Question Answering"
Language:HTML970 24 2680
pjlab-sys4nlp/llama-moe
⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training
Language:Python844 8 1944
ymcui/Chinese-Mixtral
中文Mixtral混合专家大模型（Chinese Mixtral MoE LLMs）
Language:Python575 15 1041
AviSoori1x/makeMoE
From scratch implementation of a sparse mixture of experts language model inspired by Andrej Karpathy's makemore :)
Language:Jupyter Notebook570 7 356
zhuzilin/ring-flash-attention
Ring attention implementation with flash attention
Language:Python524 9 2937
BeachWang/DAIL-SQL
A efficient and effective few-shot NL2SQL method on GPT-4.
Language:Python386 5 4365
lucidrains/local-attention
An implementation of local windowed attention for language modeling
Language:Python367 4 1940
OpenLMLab/LEval
[ACL'24 Outstanding] Data and code for L-Eval, a comprehensive long context language models evaluation benchmark
Language:Python340 4 1714
X-PLUG/ChatPLUG
A Chinese Open-Domain Dialogue System
Language:Python309 10 1527
lucidrains/st-moe-pytorch
Implementation of ST-Moe, the latest incarnation of MoE after years of research at Brain, in Pytorch
Language:Python277 7 1124
shmsw25/FActScore
A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation"
Language:Python271 4 3539
AutoLLM/AutoAgents
Complex question answering in LLMs with enhanced reasoning and information-seeking capabilities.
Language:Python166 11 320
asahi417/lmppl
Calculate perplexity on a text with pre-trained language models. Support MLM (eg. DeBERTa), recurrent LM (eg. GPT3), and encoder-decoder LM (eg. Flan-T5).
Language:Python122 4 99
fkodom/grouped-query-attention-pytorch
(Unofficial) PyTorch implementation of grouped-query attention (GQA) from "GQA: Training Generalized Multi-Query Transformer Models from Multi-Head Checkpoints" (https://arxiv.org/pdf/2305.13245.pdf)
Language:Python114 3 46
luchangli03/export_llama_to_onnx
export llama to onnx
Language:Python91 2 1911