bil-ash's Stars
spcl/QuaRot
Code for QuaRot, an end-to-end 4-bit inference of large language models.
rupeshs/fastsdcpu
Fast stable diffusion on CPU
microsoft/T-MAC
Low-bit LLM inference on CPU with lookup table
Cipherxzc/llama.cpp-bitnet
rahular/varta
nalgeon/redka
Redis re-implemented with SQLite
homebrewltd/llama3-s
Llama3.1 learns to Listen
GATECH-EIC/ShiftAddLLM
ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterization
fluencelabs/redis
Redis is an in-memory database that persists on disk. The data model is key-value, but many different kind of values are supported: Strings, Lists, Sets, Sorted Sets, Hashes, HyperLogLogs, Bitmaps.
Pints-AI/1.5-Pints
A compact LLM pretrained in 9 days by using high quality data
zhangpiu/llm.cpp
LLM training in simple, C++/CUDA(with Eigen3)
city96/ComfyUI-GGUF
GGUF Quantization support for native ComfyUI models
electric-sql/pglite
Lightweight WASM Postgres with real-time, reactive bindings.
segurac/force-host-alloction-APU
arlo-phoenix/CTranslate2-rocm
Fast inference engine for Transformer models
princeton-nlp/MeZO
[NeurIPS 2023] MeZO: Fine-Tuning Language Models with Just Forward Passes. https://arxiv.org/abs/2305.17333
stevefan1999-personal/nodebb-db-backend-typeorm
lu-wo/whisbert
babyLM WhisBERT code
PixArt-alpha/PixArt-alpha
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
openai/image-gpt
QwenLM/Qwen2-Audio
The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.
google-research/byt5
jprivera44/Mt5-fine-tuning-with-GPU-analysis
Fine-Tuning of a multi-language transformer model on Nvidia GPUs.
daixiangzi/VAR-CLIP
Implements VAR+CLIP for image generation
google-research/longt5
notnotrishi/chromenano
Run Gemini Nano locally on chrome
Laz4rz/GPT-2
Following master Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish
Srijith-rkr/Whispering-LLaMA
EMNLP 23 - Integrating Whisper Encoder to LLaMA Decoder for Generative ASR Error Correction
TinyLLaVA/TinyLLaVA_Factory
A Framework of Small-scale Large Multimodal Models
FoundationVision/LlamaGen
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation