balusch

fix me.

@Adriatic-Sea Shenzhen China

balusch's Stars

ggerganov/llama.cpp
LLM inference in C/C++
Language:C++70.2k 559 4.2k10.1k
meta-llama/llama
Inference code for Llama models
Language:Python57.1k 525 1.1k9.6k
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Language:Python37.4k 351 1.8k4.6k
rasbt/LLMs-from-scratch
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Language:Jupyter Notebook37.2k 400 1114.8k
aria2/aria2
aria2 is a lightweight multi-protocol & multi-source, cross platform download utility operated in command-line. It supports HTTP/HTTPS, FTP, SFTP, BitTorrent and Metalink.
Language:C++36.4k 742 1.9k3.6k
ray-project/ray
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Language:Python34.7k 479 19.1k5.9k
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python33.1k 271 5.8k5k
SerenityOS/serenity
The Serenity Operating System 🐞
Language:C++30.9k 351 4.2k3.2k
rockerBOO/awesome-neovim
Collections of awesome neovim plugins.
16.6k 157 121750
catppuccin/catppuccin
😸 Soothing pastel theme for the high-spirited!
Language:TypeScript15.6k 49 470280
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
Language:Python14.9k 123 1.2k1.4k
rigtorp/awesome-modern-cpp
A collection of resources on modern C++
Language:HTML12k 494 201.2k
srush/GPU-Puzzles
Solve puzzles. Learn CUDA.
Language:Jupyter Notebook10.3k 193 32968
microsoft/inshellisense
IDE style command line auto complete
Language:TypeScript9k 26 139195
cp-algorithms/cp-algorithms
Algorithm and data structure articles for https://cp-algorithms.com (based on http://e-maxx.ru)
Language:C++8k 105 3571.6k
boyter/scc
Sloc, Cloc and Code: scc is a very fast accurate code counter with complexity calculations and COCOMO estimates written in pure Go
Language:Go6.9k 39 302265
facebookresearch/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
Language:Python6.7k 44 83596
folke/which-key.nvim
💥 Create key bindings that stick. WhichKey helps you remember your Neovim keymaps, by showing available keybindings in a popup as you type.
Language:Lua5.6k 11 591180
continue-revolution/sd-webui-segment-anything
Segment Anything for Stable Diffusion WebUI
Language:Python3.4k 33 165206
DefTruth/Awesome-LLM-Inference
📖A curated list of Awesome LLM/VLM Inference Papers with codes, such as FlashAttention, PagedAttention, Parallelism, etc. 🎉🎉
3.1k 105 6211
kvcache-ai/Mooncake
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
Language:C++2.3k 27 36131
NVIDIA/TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.
Language:Python2.1k 33 369338
llvm/torch-mlir
The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.
Language:C++1.4k 248 721517
hao-ai-lab/LookaheadDecoding
[ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding
Language:Python1.2k 11 5870
skywind3000/emake
你见过的最简单的 GCC/CLANG 项目构建工具，定义式构建，比命令式更简单
Language:Python833 46 16115
dwmkerr/effective-shell
Text, samples and website for my 'Effective Shell' series.
Language:JavaScript712 16 4781
gpu-mode/awesomeMLSys
An ML Systems Onboarding list
598 15 019
bloomberg/quantum
Powerful multi-threaded coroutine dispatcher and parallel execution engine
Language:C++581 28 1395
rmarx/holblocking-blogpost
Blogpost on Head-of-Line blocking from HTTP/1 to HTTP/3
128 4 411
edwardqin-creator/StableDiffusion-Model-Evaluation-Framework
This is a framework to evaluate your stable diffusion model
Language:Python3 2 00

balusch

balusch's Stars

ggerganov/llama.cpp

meta-llama/llama

lm-sys/FastChat

rasbt/LLMs-from-scratch

aria2/aria2

ray-project/ray

vllm-project/vllm

SerenityOS/serenity

rockerBOO/awesome-neovim

catppuccin/catppuccin

Dao-AILab/flash-attention

rigtorp/awesome-modern-cpp

srush/GPU-Puzzles

microsoft/inshellisense

cp-algorithms/cp-algorithms

boyter/scc

facebookresearch/DiT

folke/which-key.nvim

continue-revolution/sd-webui-segment-anything

DefTruth/Awesome-LLM-Inference

kvcache-ai/Mooncake

NVIDIA/TransformerEngine

llvm/torch-mlir

hao-ai-lab/LookaheadDecoding

skywind3000/emake

dwmkerr/effective-shell

gpu-mode/awesomeMLSys

bloomberg/quantum

rmarx/holblocking-blogpost

edwardqin-creator/StableDiffusion-Model-Evaluation-Framework