berlino's Stars
huggingface/pytorch-image-models
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
lllyasviel/ControlNet
Let us control diffusion models!
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
stanfordnlp/dspy
DSPy: The framework for programming—not prompting—foundation models
alshedivat/al-folio
A beautiful, simple, clean, and responsive Jekyll theme for academics
BartoszJarocki/cv
Print-friendly, minimalist CV page
OpenBMB/MiniCPM-V
MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone
OpenBMB/MiniCPM
MiniCPM-2B: An end-side LLM outperforming Llama2-13B.
PixArt-alpha/PixArt-alpha
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
google/tangent
Source-to-Source Debuggable Derivatives in Pure Python
jbmouret/matplotlib_for_papers
Handout for the tutorial "Creating publication-quality figures with matplotlib"
apple/axlearn
An Extensible Deep Learning Library
facebookresearch/schedule_free
Schedule-Free Optimization in PyTorch
ChenHsing/Awesome-Video-Diffusion-Models
[Arxiv] A Survey on Video Diffusion Models
HazyResearch/ThunderKittens
Tile primitives for speedy kernels
pytorch/torchtitan
A native PyTorch Library for large model training
databricks/megablocks
srush/Triton-Puzzles
Puzzles for learning Triton
sustcsonglin/flash-linear-attention
Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton
universome/stylegan-v
[CVPR 2022] StyleGAN-V: A Continuous Video Generator with the Price, Image Quality and Perks of StyleGAN2
sihyun-yu/PVDM
Official PyTorch implementation of Video Probabilistic Diffusion Models in Projected Latent Space (CVPR 2023).
HazyResearch/zoology
Understand and test language model architectures on synthetic tasks.
google-deepmind/nanodo
shawntan/scattermoe
Triton-based implementation of Sparse Mixture of Experts.
proger/accelerated-scan
Accelerated First Order Parallel Associative Scan
clinicalml/co-llm
codekansas/rwkv
RWKV model implementation
parkervg/blendsql
Query language for blending SQL logic and LLM reasoning across multi-modal data. [Findings of ACL 2024]
subho406/Recurrent-Linear-Transformers
Implementation of Recurrent Linear Transformers in Jax+Flax.