Pinned Repositories
nyunAI's Repositories
nyunAI/nyuntam
nyunAI/PruneGPT
nyunAI/Faster-LLM-Survey
nyunAI/SFSD-LLM
nyunAI/lmquant
nyunAI/nyuntam-text-generation
nyunAI/nyuntam-vision
nyunAI/nyuntam_adapt
nyunAI/nyunzero-cli
nyunAI/PatchGD
nyunAI/PatchGD_2.0
nyunAI/TensorRT-LLM
nyunAI/AQLM
Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.pdf
nyunAI/FLAP
Patch for Grouped Query Attention
nyunAI/nyuntam-docs
This is the official documentation for nyuntam
nyunAI/qserve
QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving