berlino's Stars
huggingface/pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
lllyasviel/ControlNet
Let us control diffusion models!
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
openai/shap-e
Generate 3D objects conditioned on text or images
mistralai/mistral-inference
Official inference library for Mistral models
BartoszJarocki/cv
Print-friendly, minimalist CV page
pytorch-labs/gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
OpenBMB/MiniCPM
MiniCPM-2B: An end-side LLM outperforming Llama2-13B.
PixArt-alpha/PixArt-alpha
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
google/tangent
Source-to-Source Debuggable Derivatives in Pure Python
jbmouret/matplotlib_for_papers
Handout for the tutorial "Creating publication-quality figures with matplotlib"
facebookresearch/schedule_free
Schedule-Free Optimization in PyTorch
apple/axlearn
An Extensible Deep Learning Library
ChenHsing/Awesome-Video-Diffusion-Models
[Arxiv] A Survey on Video Diffusion Models
HazyResearch/ThunderKittens
Tile primitives for speedy kernels
databricks/megablocks
hao-ai-lab/LookaheadDecoding
srush/Triton-Puzzles
Puzzles for learning Triton
sustcsonglin/flash-linear-attention
Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton
r2d4/rellm
Exact structure out of any language model completion.
universome/stylegan-v
[CVPR 2022] StyleGAN-V: A Continuous Video Generator with the Price, Image Quality and Perks of StyleGAN2
sihyun-yu/PVDM
Official PyTorch implementation of Video Probabilistic Diffusion Models in Projected Latent Space (CVPR 2023).
HazyResearch/zoology
Understand and test language model architectures on synthetic tasks.
shawntan/scattermoe
Triton-based implementation of Sparse Mixture of Experts.
proger/accelerated-scan
Accelerated First Order Parallel Associative Scan
clinicalml/co-llm
codekansas/rwkv
RWKV model implementation
parkervg/blendsql
Query language for blending SQL logic and LLM reasoning across multi-modal data. [Findings of ACL 2024]
subho406/Recurrent-Linear-Transformers
Implementation of Recurrent Linear Transformers in Jax+Flax.