shmily326's Stars
ggerganov/llama.cpp
LLM inference in C/C++
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
liguodongiot/llm-action
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
ggerganov/ggml
Tensor library for machine learning
facebookresearch/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
Oneflow-Inc/oneflow
OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.
NVIDIA/cutlass
CUDA Templates for Linear Algebra Subroutines
huggingface/alignment-handbook
Robust recipes to align language models with human and AI preferences
ztxz16/fastllm
纯c++的全平台llm加速库,支持python调用,chatglm-6B级模型单卡可达10000+token / s,支持glm, llama, moss基座,手机端流畅运行
fastnlp/fastNLP
fastNLP: A Modularized and Extensible NLP Framework. Currently still in incubation.
Zjh-819/LLMDataHub
A quick guide (especially) for trending instruction finetuning datasets
zjhellofss/KuiperInfer
校招、秋招、春招、实习好项目!带你从零实现一个高性能的深度学习推理库,支持大模型 llama2 、Unet、Yolov5、Resnet等模型的推理。Implement a high-performance deep learning inference library step by step
BBuf/how-to-optim-algorithm-in-cuda
how to optimize some algorithm in cuda.
horseee/Awesome-Efficient-LLM
A curated list for Efficient Large Language Models
HuangOwen/Awesome-LLM-Compression
Awesome LLM compression research papers and tools.
jackaduma/awesome_LLMs_interview_notes
LLMs interview notes and answers:该仓库主要记录大模型(LLMs)算法工程师相关的面试题和参考答案
tjumcw/6.824
MIT 6.824 distributed system C++Version
alibaba/Pai-Megatron-Patch
The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.
alibaba/Megatron-LLaMA
Best practice for training LLaMA models in Megatron-LM
Eddie-Wang1120/HPC-Learning-Notes
高性能计算相关知识学习笔记,包含学习笔记和相关知识的代码demo,在持续完善中。 如果有帮助的话请Star一下,对作者帮助很大,谢谢!
HqWu-HITCS/Awesome-LLM-Survey
An Awesome Collection for LLM Survey
chenzomi12/DeepLearningSystem
AI Infra主要是指AI的基础建设,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术。
zhenlohuang/awesome-chinese-llm
Awesome Chinese LLM: A curated list of Chinese Large Language Model 中文大语言模型数据集和模型资料汇总
dsdanielpark/open-llm-datasets
Repository for organizing datasets and papers used in Open LLM.
coldlarry/llama2.cpp
Inference Llama 2 in one file of pure C
xefoci7612/baby-llama2.cpp
Inference for Llama-2 Transformer model in C/C++