D0z1ngShark

UCAS

D0z1ngShark's Stars

jaywcjlove/awesome-mac
 Now we have become very big, Different from the original idea. Collect premium software in various categories.
Language:JavaScript76.4k 1.5k 4446.3k
karpathy/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Language:Python36.9k 371 3175.8k
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Language:Python33k 204 5.1k4.1k
rasbt/LLMs-from-scratch
Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step
Language:Jupyter Notebook30k 319 993.5k
linexjlin/GPTs
leaked prompts of GPTs
28.6k 305 273.9k
ymcui/Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
Language:Python18.3k 185 7311.9k
LlamaFamily/Llama-Chinese
Llama中文社区，Llama3在线体验和微调模型已开放，实时汇总最新Llama3学习资料，已将所有代码更新适配Llama3，构建最好的中文Llama大模型，完全开源可商用
Language:Python13.9k 146 3341.2k
naklecha/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
Language:Jupyter Notebook13.6k 93 161.1k
NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
Language:Python10.4k 162 7612.3k
dair-ai/ML-Papers-of-the-Week
🔥Highlighting the top ML papers every week.
10.2k 859 5593
mistralai/mistral-inference
Official inference library for Mistral models
Language:Jupyter Notebook9.7k 125 144855
huggingface/tokenizers
💥 Fast State-of-the-Art Tokenizers optimized for Research and Production
Language:Rust9k 119 995794
facebookresearch/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
Language:Python8.5k 76 544609
Morizeyao/GPT2-Chinese
Chinese version of GPT2 training code, using BERT tokenizer.
Language:Python7.5k 161 2511.7k
ymcui/Chinese-LLaMA-Alpaca-2
中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)
Language:Python7.1k 79 389579
lyogavin/airllm
AirLLM 70B inference with single 4GB GPU
Language:Jupyter Notebook4.8k 124 170379
microsoft/LLMLingua
[EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.
Language:Python4.6k 33 124253
stanford-futuredata/ColBERT
ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)
Language:Python3k 42 264384
OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention)
Language:Python2.3k 20 264228
Tele-AI/Telechat
Language:Python1.8k 21 6197
XueFuzhao/OpenMoE
A family of open-sourced Mixture-of-Experts (MoE) Large Language Models
Language:Python1.4k 14 871
OpenLMLab/MOSS-RLHF
MOSS-RLHF
Language:Python1.3k 34 53101
charent/ChatLM-mini-Chinese
中文对话0.2B小模型（ChatLM-Chinese-0.2B），开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调，给出三元组信息抽取微调示例。
Language:Python1.2k 12 50145
dandelionsllm/pandallm
Panda项目是于2023年5月启动的开源海外中文大语言模型项目，致力于大模型时代探索整个技术栈，旨在推动中文自然语言处理领域的创新和合作。
Language:Python1.1k 38 3491
deepseek-ai/DeepSeek-MoE
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
Language:Python991 15 3848
HIT-SCIR/Chinese-Mixtral-8x7B
中文Mixtral-8x7B（Chinese-Mixtral-8x7B）
Language:Python640 15 2932
AviSoori1x/makeMoE
From scratch implementation of a sparse mixture of experts language model inspired by Andrej Karpathy's makemore :)
Language:Jupyter Notebook587 7 359
charent/Phi2-mini-Chinese
Phi2-Chinese-0.2B 从0开始训练自己的Phi2中文小模型，支持接入langchain加载本地知识库做检索增强生成RAG。Training your own Phi2 small chat model from scratch.
Language:Jupyter Notebook479 8 1551
GAIR-NLP/MathPile
[NeurlPS D&B 2024] Generative AI for Math: MathPile
Language:Python387 7 520
yanqiangmiffy/how-to-train-tokenizer
怎么训练一个LLM分词器
Language:Python129 6 227

D0z1ngShark

D0z1ngShark's Stars

jaywcjlove/awesome-mac

karpathy/nanoGPT

hiyouga/LLaMA-Factory

rasbt/LLMs-from-scratch

linexjlin/GPTs

ymcui/Chinese-LLaMA-Alpaca

LlamaFamily/Llama-Chinese

naklecha/llama3-from-scratch

NVIDIA/Megatron-LM

dair-ai/ML-Papers-of-the-Week

mistralai/mistral-inference

huggingface/tokenizers

facebookresearch/xformers

Morizeyao/GPT2-Chinese

ymcui/Chinese-LLaMA-Alpaca-2

lyogavin/airllm

microsoft/LLMLingua

stanford-futuredata/ColBERT

OpenRLHF/OpenRLHF

Tele-AI/Telechat

XueFuzhao/OpenMoE

OpenLMLab/MOSS-RLHF

charent/ChatLM-mini-Chinese

dandelionsllm/pandallm

deepseek-ai/DeepSeek-MoE

HIT-SCIR/Chinese-Mixtral-8x7B

AviSoori1x/makeMoE

charent/Phi2-mini-Chinese

GAIR-NLP/MathPile

yanqiangmiffy/how-to-train-tokenizer