bil-ash

bil-ash's Stars

spcl/QuaRot
Code for QuaRot, an end-to-end 4-bit inference of large language models.
Language:Python25720
rupeshs/fastsdcpu
Fast stable diffusion on CPU
Language:Python1.4k115
microsoft/T-MAC
Low-bit LLM inference on CPU with lookup table
Language:C++45633
Cipherxzc/llama.cpp-bitnet
Language:C++1
rahular/varta
Language:Python8
nalgeon/redka
Redis re-implemented with SQLite
Language:Go3.4k94
homebrewltd/llama3-s
Llama3.1 learns to Listen
Language:Python1485
GATECH-EIC/ShiftAddLLM
ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterization
Language:Python849
fluencelabs/redis
Redis is an in-memory database that persists on disk. The data model is key-value, but many different kind of values are supported: Strings, Lists, Sets, Sorted Sets, Hashes, HyperLogLogs, Bitmaps.
Language:C302
Pints-AI/1.5-Pints
A compact LLM pretrained in 9 days by using high quality data
Language:Python23417
zhangpiu/llm.cpp
LLM training in simple, C++/CUDA(with Eigen3)
Language:Cuda10
city96/ComfyUI-GGUF
GGUF Quantization support for native ComfyUI models
Language:Python79245
electric-sql/pglite
Lightweight WASM Postgres with real-time, reactive bindings.
Language:TypeScript8.3k163
segurac/force-host-alloction-APU
Language:Python372
arlo-phoenix/CTranslate2-rocm
Fast inference engine for Transformer models
Language:C++71
princeton-nlp/MeZO
[NeurIPS 2023] MeZO: Fine-Tuning Language Models with Just Forward Passes. https://arxiv.org/abs/2305.17333
Language:Python1k61
stevefan1999-personal/nodebb-db-backend-typeorm
Language:TypeScript1
lu-wo/whisbert
babyLM WhisBERT code
Language:Jupyter Notebook141
PixArt-alpha/PixArt-alpha
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Language:Python2.7k172
openai/image-gpt
Language:Python2k387
QwenLM/Qwen2-Audio
The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.
Language:Python1.1k66
google-research/byt5
Language:Python48230
jprivera44/Mt5-fine-tuning-with-GPU-analysis
Fine-Tuning of a multi-language transformer model on Nvidia GPUs.
Language:Jupyter Notebook1
daixiangzi/VAR-CLIP
Implements VAR+CLIP for image generation
Language:Python672
google-research/longt5
Language:Python17818
notnotrishi/chromenano
Run Gemini Nano locally on chrome
Language:HTML23
Laz4rz/GPT-2
Following master Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish
Language:Jupyter Notebook1655
Srijith-rkr/Whispering-LLaMA
EMNLP 23 - Integrating Whisper Encoder to LLaMA Decoder for Generative ASR Error Correction
Language:Jupyter Notebook22615
TinyLLaVA/TinyLLaVA_Factory
A Framework of Small-scale Large Multimodal Models
Language:Python58453
FoundationVision/LlamaGen
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
Language:Python1.2k49