sankeerth95's Stars
StanfordAHA/aha
IST-DASLab/marlin
FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.
HazyResearch/ThunderKittens
Tile primitives for speedy kernels
microsoft/VPTQ
VPTQ, A Flexible and Extreme low-bit quantization algorithm
NVIDIA/NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
wangsiping97/FastGEMV
High-speed GEMV kernels, at most 2.7x speedup compared to pytorch baseline.
HanGuo97/flute
Fast Matrix Multiplications for Lookup Table-Quantized LLMs
microsoft/microxcaling
PyTorch emulation library for Microscaling (MX)-compatible data formats
Cornell-RelaxML/qtip
huggingface/optimum-quanto
A pytorch quantization backend for optimum
bitsandbytes-foundation/bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
Godofnothing/aqlm-evaluation-harness
lm-evaluation-harness version with support of AQLM intermediate checkpoints
lucidrains/vector-quantize-pytorch
Vector (and Scalar) Quantization, in Pytorch
HuangOwen/Awesome-LLM-Compression
Awesome LLM compression research papers and tools.
Vahe1994/AQLM
Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.pdf and PV-Tuning: Beyond Straight-Through Estimation for Extreme LLM Compression https://arxiv.org/abs/2405.14852
Cornell-RelaxML/quip-sharp
facebookresearch/MobileLLM
MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.
threestudio-project/threestudio
A unified framework for 3D content generation.
hustvl/GaussianDreamer
GaussianDreamer: Fast Generation from Text to 3D Gaussians by Bridging 2D and 3D Diffusion Models (CVPR 2024)
gem5-graphics/gem5-graphics
gem5 simulator with a gpgpu+graphics GPU model
Florian-Barthel/splatviz
Full python interactive 3D Gaussian Splatting viewer for real-time editing and analyzing.
horizon-research/Fov-3DGS
Official Implementation of RTGS: Enabling Real-Time Gaussian Splatting on Mobile Devices Using Efficiency-Guided Pruning and Foveated Rendering.
maxvfischer/DIY-CNC-machine
How to build your own CNC machine from scratch
ocornut/imgui
Dear ImGui: Bloat-free Graphical User interface for C++ with minimal dependencies
unixpickle/learn-ptx
Learning about CUDA by writing PTX code.
limacv/GaussianSplattingViewer
Tiny Gaussian Splatting Viewer
dendenxu/fast-gaussian-rasterization
A geometry-shader-based, global CUDA sorted high-performance 3D Gaussian Splatting rasterizer. Can achieve a 5-10x speedup in rendering compared to the vanialla diff-gaussian-rasterization.
dendenxu/diff-gaussian-rasterization
Improved 3DGS rasterizer.
EricLee0224/awesome-nerf-editing
🧙🏻♂️A list of papers curated for you to dive into the Awesome Radiance Field-based 3D Editing.