inkinworld

Try Try Try

HangZhou

inkinworld's Stars

All-Hands-AI/OpenHands
🙌 OpenHands: Code Less, Make More
Language:Python31.3k 285 1.3k3.6k
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python26.8k 223 4.4k3.9k
iovisor/bcc
BCC - Tools for BPF-based Linux IO analysis, networking, monitoring, and more
Language:C20.3k 557 1.9k3.8k
samber/lo
💥 A Lodash-style Go library based on Go 1.18+ Generics (map, filter, contains, find...)
Language:Go17.4k 77 208799
karpathy/minbpe
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
Language:Python9k 83 36830
huggingface/text-generation-inference
Large Language Model Text Generation Inference
Language:Python8.8k 99 1.3k1k
THUDM/CodeGeeX2
CodeGeeX2: A More Powerful Multilingual Code Generation Model
Language:Python7.6k 65 248532
harvardnlp/annotated-transformer
An annotated implementation of the Transformer paper.
Language:Jupyter Notebook5.6k 65 881.2k
sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
Language:Python5.1k 52 511359
kf-liu/The-Art-of-Linear-Algebra-zh-CN
Graphic notes on Gilbert Strang's "Linear Algebra for Everyone", 线性代数的艺术中文版, 欢迎PR.
Language:PostScript4.4k 40 0445
InternLM/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Language:Python4.2k 35 1.3k376
bbycroft/llm-viz
3D Visualization of an GPT-style LLM
Language:TypeScript3.8k 32 14419
Tencent/cherry-markdown
✨ A Markdown Editor
Language:JavaScript3.5k 46 495412
huggingface/text-embeddings-inference
A blazing fast inference solution for text embeddings models
Language:Rust2.6k 33 240161
DefTruth/Awesome-LLM-Inference
📖A curated list of Awesome LLM Inference Paper with codes, TensorRT-LLM, vLLM, streaming-llm, AWQ, SmoothQuant, WINT8/4, Continuous Batching, FlashAttention, PagedAttention etc.
2.5k 81 6162
mengjian-github/copilot-analysis
Language:JavaScript2k 21 7239
ZiyaoGeng/RecLearn
Recommender Learning with Tensorflow2.x
Language:Python1.8k 35 82492
BBuf/how-to-optim-algorithm-in-cuda
how to optimize some algorithm in cuda.
Language:Cuda1.4k 21 9118
HeKun-NVIDIA/CUDA-Programming-Guide-in-Chinese
This is a Chinese translation of the CUDA programming guide
1.2k 37 1185
mingrammer/flog
:tophat: A fake log generator for common log formats
Language:Go1.1k 8 43133
triton-inference-server/pytriton
PyTriton is a Flask/FastAPI-like interface that simplifies Triton's deployment in Python environments.
Language:Python715 18 7348
IST-DASLab/marlin
FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.
Language:Python560 15 2745
zhaozhiyong19890102/Recommender-System
推荐系统综述
437 14 059
zc911/MatrixSlow
A simple deep learning framework in pure python for purpose of learning in DL
Language:TypeScript421 11 786
hellotransformers/Natural_Language_Processing_with_Transformers
Natural Language Processing with Transformers 中译本，最权威Transformers教程
363 4 289
charlotteLive/pybind11-Chinese-docs
pybind11中文文档（个人翻译）
246 2 258
jy-yuan/KIVI
KIVI: A Tuning-Free Asymmetric 2bit Quantization for KV Cache
Language:Python213 5 2419
Nanbeige/Nanbeige
Language:Python84 2 39
KyleBing/map
路书，路线规划，高德地图 api 示例，地图信息 vue3 ts vite
Language:Vue81 3 718
run-ai/llmperf
Language:C++42 3 18

inkinworld

inkinworld's Stars

All-Hands-AI/OpenHands

vllm-project/vllm

iovisor/bcc

samber/lo

karpathy/minbpe

huggingface/text-generation-inference

THUDM/CodeGeeX2

harvardnlp/annotated-transformer

sgl-project/sglang

kf-liu/The-Art-of-Linear-Algebra-zh-CN

InternLM/lmdeploy

bbycroft/llm-viz

Tencent/cherry-markdown

huggingface/text-embeddings-inference

DefTruth/Awesome-LLM-Inference

mengjian-github/copilot-analysis

ZiyaoGeng/RecLearn

BBuf/how-to-optim-algorithm-in-cuda

HeKun-NVIDIA/CUDA-Programming-Guide-in-Chinese

mingrammer/flog

triton-inference-server/pytriton

IST-DASLab/marlin

zhaozhiyong19890102/Recommender-System

zc911/MatrixSlow

hellotransformers/Natural_Language_Processing_with_Transformers

charlotteLive/pybind11-Chinese-docs

jy-yuan/KIVI

Nanbeige/Nanbeige

KyleBing/map

run-ai/llmperf