wangdelp's Stars
xai-org/grok-1
Grok open release
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
leptonai/search_with_lepton
Building a quick conversation-based search demo with Lepton AI.
FoundationVision/VAR
[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
thunil/TecoGAN
This repo contains source code and materials for the TEmporally COherent GAN SIGGRAPH project.
aimhubio/aim
Aim 💫 — An easy-to-use & supercharged open-source experiment tracker.
showlab/Awesome-Video-Diffusion
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
ali-vilab/VGen
Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models
quic/aimet
AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.
real-stanford/diffusion_policy
[RSS 2023] Diffusion Policy Visuomotor Policy Learning via Action Diffusion
gnobitab/InstaFlow
:zap: InstaFlow! One-Step Stable Diffusion with Rectified Flow (ICLR 2024)
lucidrains/muse-maskgit-pytorch
Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch
real-stanford/universal_manipulation_interface
Universal Manipulation Interface: In-The-Wild Robot Teaching Without In-The-Wild Robots
willisma/SiT
Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"
dome272/Wuerstchen
Official implementation of Würstchen: Efficient Pretraining of Text-to-Image Models
mkshing/ziplora-pytorch
Implementation of "ZipLoRA: Any Subject in Any Style by Effectively Merging LoRAs"
philipturner/metal-flash-attention
FlashAttention (Metal Port)
Newbeeer/pfgmpp
Code for ICML 2023 paper, "PFGM++: Unlocking the Potential of Physics-Inspired Generative Models"
kyegomez/CM3Leon
An open source implementation of "Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning", an all-new multi modal AI that uses just a decoder to generate both text and images
huggingface/open-muse
Open reproduction of MUSE for fast text2image generation.
locuslab/ect
Consistency Models Made Easy
mlomnitz/DiffJPEG
junhsss/consistency-models
A Toolkit for OpenAI's Consistency Models.
wenhao728/awesome-diffusion-v2v
Awesome diffusion Video-to-Video (V2V). A collection of paper on diffusion model-based video editing, aka. video-to-video (V2V) translation. And a video editing benchmark code.
AlexiaJM/AdversarialConsistentScoreMatching
Code for paper "Adversarial score matching and improved sampling for image generation"
huanngzh/EpiDiff
[CVPR 2024] EpiDiff: Enhancing Multi-View Synthesis via Localized Epipolar-Constrained Diffusion
apple/ml-vision-transformers-ane
apple/ml-tract