zeyuanyin's Stars
mit-han-lab/llm-awq
[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
NVlabs/VILA
VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
openai/CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
lucidrains/x-clip
A concise but complete implementation of CLIP with various experimental improvements from recent papers
Trustworthy-AI-Group/bib_parse
X-PLUG/mPLUG-Owl
mPLUG-Owl: The Powerful Multi-modal Large Language Model Family
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
TRI-ML/prismatic-vlms
A flexible and efficient codebase for training visually-conditioned language models (VLMs)
mlfoundations/open_clip
An open source implementation of CLIP.
Cerebras/modelzoo
intel/intel-extension-for-transformers
⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡
ModelTC/lightllm
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
HazyResearch/data-centric-ai
Resources for Data Centric AI
triton-lang/triton
Development repository for the Triton language and compiler
state-spaces/s4
Structured state space sequence models
srush/annotated-mamba
Annotated version of the Mamba paper
berlino/gated_linear_attention
lucidrains/MEGABYTE-pytorch
Implementation of MEGABYTE, Predicting Million-byte Sequences with Multiscale Transformers, in Pytorch
yyyujintang/Awesome-Mamba-Papers
Awesome Papers related to Mamba.
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
NUS-HPC-AI-Lab/DATM
ICLR 2024, Towards Lossless Dataset Distillation via Difficulty-Aligned Trajectory Matching
lisadunlap/ALIA
Augmenting with Language-guided Image Augmentation (ALIA)
aditya-shri/VPN
Personal VPN using Shadowsocks and v2ray
allenai/OLMo
Modeling, training, eval, and inference code for OLMo
state-spaces/mamba
Mamba SSM architecture
bitsandbytes-foundation/bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
lucidrains/llama-qrlhf
Implementation of the Llama architecture with RLHF + Q-learning
jzhang38/TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
pytorch/data
A PyTorch repo for data loading and utilities to be shared by the PyTorch domain libraries.
rasbt/LLMs-from-scratch
Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step