lokinko

重庆大学

lokinko's Stars

ggerganov/llama.cpp
LLM inference in C/C++
Language:C++69.8k 558 4.2k10.1k
comfyanonymous/ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Language:Python61.4k 431 4.1k6.6k
rasbt/LLMs-from-scratch
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Language:Jupyter Notebook36.3k 394 1094.6k
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python32.7k 271 5.7k5k
openai/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Language:Jupyter Notebook26.7k 325 4053.4k
HqWu-HITCS/Awesome-Chinese-LLM
整理开源的中文大语言模型，以规模较小、可私有化部署、训练成本较低的模型为主，包括底座模型，垂直领域微调及应用，数据集与教程等。
17.1k 216 271.6k
systemdesign42/system-design
A resource to help you pass system design interview and become good at work 👇
13.5k 269 21.4k
OpenBMB/MiniCPM-V
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
Language:Python12.9k 106 612906
liguodongiot/llm-action
本项目旨在分享大模型相关技术原理以及实战经验（大模型工程化、大模型应用落地）
Language:HTML12.2k 101 231.3k
NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
Language:Python10.9k 166 8112.4k
aishwaryanr/awesome-generative-ai-guide
A one stop repository for generative AI research updates, interview resources, notebooks and much more!
9.9k 338 102.1k
facebookresearch/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
Language:Python6.6k 44 83592
google/gemma.cpp
lightweight, standalone C++ inference engine for Google's Gemma models.
Language:C++6.1k 41 88518
deepseek-ai/DeepSeek-V2
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
3.9k 33 91175
DefTruth/Awesome-LLM-Inference
📖A curated list of Awesome LLM/VLM Inference Papers with codes, such as FlashAttention, PagedAttention, Parallelism, etc. 🎉🎉
3.1k 104 6207
ModelTC/lightllm
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
Language:Python2.7k 22 189217
FasterDecoding/Medusa
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
Language:Jupyter Notebook2.4k 31 92164
Eladlev/AutoPrompt
A framework for prompt tuning using Intent-based Prompt Calibration
Language:Python2.3k 12 41198
BBuf/how-to-optim-algorithm-in-cuda
how to optimize some algorithm in cuda.
Language:Cuda1.8k 26 10147
datawhalechina/tiny-universe
《大模型白盒子构建指南》：一个全手搓的Tiny-Universe
Language:Python1.8k 18 14181
hyp1231/awesome-llm-powered-agent
Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...
1.7k 49 9137
microsoft/ToRA
ToRA is a series of Tool-integrated Reasoning LLM Agents designed to solve challenging mathematical reasoning problems by interacting with tools [ICLR'24].
Language:Python1k 19 2772
NVlabs/DoRA
[ICML2024 (Oral)] Official PyTorch implementation of DoRA: Weight-Decomposed Low-Rank Adaptation
Language:Python668 11 1844
liguodongiot/llm-resource
LLM全栈优质资源汇总
Language:Shell417 7 046
LLMServe/DistServe
Disaggregated serving system for Large Language Models (LLMs).
Language:Jupyter Notebook404 5 4550
apple/pfl-research
Simulation framework for accelerating research in Private Federated Learning
Language:Jupyter Notebook306 22 1332
wuhobin/blog-home
一个干净简洁的个人作品集合主页
Language:HTML244 2 281
galeselee/Awesome_LLM_System-PaperList
Since the emergence of chatGPT in 2022, the acceleration of Large Language Model has become increasingly important. Here is a list of papers on accelerating LLMs, currently focusing mainly on inference acceleration, and related works will be gradually added in the future. Welcome contributions!
191 4 07
TemporaryLoRA/Temp-LoRA
Language:Python91 2 57
weishengying/tiny-flash-attention
使用 cutlass 实现 flash-attention 精简版，具有教学意义
Language:Cuda32 1 04

lokinko

lokinko's Stars

ggerganov/llama.cpp

comfyanonymous/ComfyUI

rasbt/LLMs-from-scratch

vllm-project/vllm

openai/CLIP

HqWu-HITCS/Awesome-Chinese-LLM

systemdesign42/system-design

OpenBMB/MiniCPM-V

liguodongiot/llm-action

NVIDIA/Megatron-LM

aishwaryanr/awesome-generative-ai-guide

facebookresearch/DiT

google/gemma.cpp

deepseek-ai/DeepSeek-V2

DefTruth/Awesome-LLM-Inference

ModelTC/lightllm

FasterDecoding/Medusa

Eladlev/AutoPrompt

BBuf/how-to-optim-algorithm-in-cuda

datawhalechina/tiny-universe

hyp1231/awesome-llm-powered-agent

microsoft/ToRA

NVlabs/DoRA

liguodongiot/llm-resource

LLMServe/DistServe

apple/pfl-research

wuhobin/blog-home

galeselee/Awesome_LLM_System-PaperList

TemporaryLoRA/Temp-LoRA

weishengying/tiny-flash-attention