heekyungyoon's Stars
mit-han-lab/smoothquant
[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
IST-DASLab/gptq
Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".
bitsandbytes-foundation/bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
karpathy/llm.c
LLM training in simple, raw C/CUDA
apache/tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
cuda-mode/lectures
Material for cuda-mode lectures
warpdotdev/Warp
Warp is a modern, Rust-based terminal with AI built in so you and your team can build great software, faster.
andrewekhalel/MLQuestions
Machine Learning and Computer Vision Engineer - Technical Interview Questions
onnx/onnx
Open standard for machine learning interoperability
llvm/llvm-project
The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.
ChaoningZhang/MobileSAM
This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!
lutzroeder/netron
Visualizer for neural network, deep learning and machine learning models
milvus-io/milvus
A cloud-native vector database, storage for next generation AI applications
huggingface/candle
Minimalist ML framework for Rust
NVIDIA-Merlin/Transformers4Rec
Transformers4Rec is a flexible and efficient library for sequential and session-based recommendation and works with PyTorch.
erikbern/ann-benchmarks
Benchmarks of approximate nearest neighbor libraries in Python
facebookresearch/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
apple/ml-fastvit
This repository contains the official implementation of the research paper, "FastViT: A Fast Hybrid Vision Transformer using Structural Reparameterization" ICCV 2023
JingyunLiang/SwinIR
SwinIR: Image Restoration Using Swin Transformer (official repository)
mv-lab/swin2sr
[ECCV] Swin2SR: SwinV2 Transformer for Compressed Image Super-Resolution and Restoration. Advances in Image Manipulation (AIM) workshop ECCV 2022. Try it out! over 3.3M runs https://replicate.com/mv-lab/swin2sr
facebookresearch/d2go
D2Go is a toolkit for efficient deep learning
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
bentoml/BentoML
The easiest way to serve AI apps and models - Build reliable Inference APIs, LLM apps, Multi-model chains, RAG service, and much more!
titsitits/open-image-restoration
Open-Image-Restoration Toolkit: A selection of State-ot-the-art, Open-source, Usable, and Pythonic techniques for Image Restoration
NVlabs/stylegan2
StyleGAN2 - Official TensorFlow Implementation
xinntao/Real-ESRGAN
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
TencentARC/GFPGAN
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
triton-lang/triton
Development repository for the Triton language and compiler