shmily326

shmily326's Stars

ggerganov/llama.cpp
LLM inference in C/C++
Language:C++69.2k 551 4.2k9.9k
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Language:Python37.2k 353 1.8k4.6k
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Language:Python36k 214 5.5k4.4k
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Language:Python35.9k 346 2.9k4.2k
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python31.8k 255 5.6k4.8k
NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Language:Python12.4k 209 2.3k2.6k
liguodongiot/llm-action
本项目旨在分享大模型相关技术原理以及实战经验（大模型工程化、大模型应用落地）
Language:HTML11.7k 95 221.2k
ggerganov/ggml
Tensor library for machine learning
Language:C++11.4k 132 4271.1k
facebookresearch/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
Language:Python8.8k 77 572631
Oneflow-Inc/oneflow
OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.
Language:C++6.1k 150 1k685
NVIDIA/cutlass
CUDA Templates for Linear Algebra Subroutines
Language:C++5.8k 109 1.2k1k
huggingface/alignment-handbook
Robust recipes to align language models with human and AI preferences
Language:Python4.8k 112 137417
ztxz16/fastllm
纯c++的全平台llm加速库，支持python调用，chatglm-6B级模型单卡可达10000+token / s，支持glm, llama, moss基座，手机端流畅运行
Language:C++3.3k 43 365345
fastnlp/fastNLP
fastNLP: A Modularized and Extensible NLP Framework. Currently still in incubation.
Language:Python3.1k 82 218448
Zjh-819/LLMDataHub
A quick guide (especially) for trending instruction finetuning datasets
2.7k 50 3172
zjhellofss/KuiperInfer
校招、秋招、春招、实习好项目！带你从零实现一个高性能的深度学习推理库，支持大模型 llama2 、Unet、Yolov5、Resnet等模型的推理。Implement a high-performance deep learning inference library step by step
Language:C++2.6k 26 28298
BBuf/how-to-optim-algorithm-in-cuda
how to optimize some algorithm in cuda.
Language:Cuda1.7k 26 10141
horseee/Awesome-Efficient-LLM
A curated list for Efficient Large Language Models
Language:Python1.3k 41 494
HuangOwen/Awesome-LLM-Compression
Awesome LLM compression research papers and tools.
1.2k 41 682
jackaduma/awesome_LLMs_interview_notes
LLMs interview notes and answers:该仓库主要记录大模型（LLMs）算法工程师相关的面试题和参考答案
1.2k 17 6279
tjumcw/6.824
MIT 6.824 distributed system C++Version
Language:C++799 1 9132
alibaba/Pai-Megatron-Patch
The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.
Language:Python748 10 151107
alibaba/Megatron-LLaMA
Best practice for training LLaMA models in Megatron-LM
Language:Python634 7 6353
Eddie-Wang1120/HPC-Learning-Notes
高性能计算相关知识学习笔记，包含学习笔记和相关知识的代码demo，在持续完善中。如果有帮助的话请Star一下，对作者帮助很大，谢谢！
Language:Jupyter Notebook386 6 134
HqWu-HITCS/Awesome-LLM-Survey
An Awesome Collection for LLM Survey
312 13 031
chenzomi12/DeepLearningSystem
AI Infra主要是指AI的基础建设，包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术。
186 2 13
zhenlohuang/awesome-chinese-llm
Awesome Chinese LLM: A curated list of Chinese Large Language Model 中文大语言模型数据集和模型资料汇总
129 3 010
dsdanielpark/open-llm-datasets
Repository for organizing datasets and papers used in Open LLM.
92 5 06
coldlarry/llama2.cpp
Inference Llama 2 in one file of pure C
Language:C8 1 03
xefoci7612/baby-llama2.cpp
Inference for Llama-2 Transformer model in C/C++
Language:C++3 2 00