synxlin's Stars
FoundationVision/VAR
[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
mit-han-lab/qserve
QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving
mit-han-lab/lmquant
spcl/QuaRot
Code for QuaRot, an end-to-end 4-bit inference of large language models.
mit-han-lab/streaming-llm
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
pest-parser/pest
The Elegant Parser
taocpp/PEGTL
Parsing Expression Grammar Template Library
UCLA-VAST/AutoSA
AutoSA: Polyhedral-Based Systolic Array Compiler
madhat2r/plaid2text
Python Scripts to export Plaid transactions and transform them into Ledger or Beancount syntax formatted files.
ggerganov/llama.cpp
LLM inference in C/C++
jbms/beancount-import
Web UI for semi-automatically importing external data into beancount
davidepatti/noxim
Network on Chip Simulator
flexflow/FlexFlow
FlexFlow Serve: Low-Latency, High-Performance LLM Serving
cryptomator/cryptomator
Cryptomator for Windows, macOS, and Linux: Secure client-side encryption for your cloud storage, ensuring privacy and control over your data.
NVlabs/timeloop
Timeloop performs modeling, mapping and code-generation for tensor algebra workloads on various accelerator architectures.
mit-han-lab/bevfusion
[ICRA'23] BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation
traveller59/spconv
Spatial Sparse Convolution Library
FindDefinition/cumm
CUda Matrix Multiply library.
FindDefinition/PCCM
Python C++ Code Manager
nineisprime/optimal-branch
A light-weight implementation of Edmond's Algorithm in C++
hpcaitech/FastFold
Optimizing AlphaFold Training and Inference on GPU Clusters
aqlaboratory/openfold
Trainable, memory-efficient, and GPU-friendly PyTorch reproduction of AlphaFold 2
ucb-bar/cosa
A scheduler for spatial DNN accelerators that generate high-performance schedules in one shot using mixed integer programming (MIP)
VITA-Group/TENAS
[ICLR 2021] "Neural Architecture Search on ImageNet in Four GPU Hours: A Theoretically Inspired Perspective" by Wuyang Chen, Xinyu Gong, Zhangyang Wang
GATECH-EIC/HW-NAS-Bench
[ICLR 2021] HW-NAS-Bench: Hardware-Aware Neural Architecture Search Benchmark
maestro-project/confuciux
maestro-project/gamma
pku-liang/TENET
An analytical framework that models hardware dataflow of tensor applications on spatial architectures using the relation-centric notation.
ipcjs/bilibili-helper
各种油猴脚本
mit-han-lab/torchsparse
[MICRO'23, MLSys'22] TorchSparse: Efficient Training and Inference Framework for Sparse Convolution on GPUs.