JinuJeong's Stars
microsoft/BitNet
Official inference framework for 1-bit LLMs
VIA-Research/uPIMulator
abdullahfsm/PCS
sarchlab/mgpusim
A highly-flexible GPU simulator for AMD GPUs.
sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
Feh/nocache
minimize caching effects
project-baize/baize-chatbot
Let ChatGPT teach your own chatbot in hours with a single GPU!
flexflow/FlexFlow
FlexFlow Serve: Low-Latency, High-Performance LLM Serving
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
triton-lang/triton
Development repository for the Triton language and compiler
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
chrischoy/MakePytorchPlusPlus
How and why you want to make your pytorch CUDA/CPP extension with a Makefile
facebookresearch/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Azrael3000/tmpi
Run a parallel command inside a split tmux window
NVIDIA/FasterTransformer
Transformer related optimization, including BERT, GPT
Raphael-Hao/Abacus
Sys-KU/AutoTiering
Exploring the Design Space of Page Management for Multi-Tiered Memory Systems (USENIX ATC '21)
Sys-KU/DeepPlan
Fast and Efficient Model Serving Using Multi-GPUs with Direct-Host-Access (ACM EuroSys '23)
casys-kaist/HUVM
casys-kaist/CoVA
Official code repository for "CoVA: Exploiting Compressed-Domain Analysis to Accelerate Video Analytics [USENIX ATC 22]"
neomorphism/neomo
Neomorphism(neumorphism) Design Framework Open Source
neoclide/coc.nvim
Nodejs extension host for vim & neovim, load extensions like VSCode and host language servers.
khakiee/comments_collector
Collect naver entertain news comments