painebenjamin's Stars
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
lllyasviel/IC-Light
More relighting!
DepthAnything/Depth-Anything-V2
[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
TMElyralab/MuseTalk
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
alibaba/EasyNLP
EasyNLP: A Comprehensive and Easy-to-use NLP Toolkit
FoundationVision/LlamaGen
Autoregressive Model Beats Diffusion: š¦ Llama for Scalable Image Generation
ali-vilab/MimicBrush
Official implementations for paper: Zero-shot Image Editing with Reference Imitation
DachunKai/EvTexture
[ICML 2024] EvTexture: Event-driven Texture Enhancement for Video Super-Resolution
ali-vilab/UniAnimate
Code for Paper "UniAnimate: Taming Unified Video Diļ¬usion Models for Consistent Human Image Animation".
lucidrains/magvit2-pytorch
Implementation of MagViT2 Tokenizer in Pytorch
Jeff-LiangF/streamv2v
Official Pytorch implementation of StreamV2V.
fal-ai/aura-sr
AuraSR: GAN-based Super-Resolution for real-world
TheMistoAI/ComfyUI-Anyline
Anyline: A Fast, Accurate, and Detailed Line Detection Preprocessor
lllyasviel/LayerDiffuse_DiffusersCLI
LayerDiffuse in pure diffusers without any GUI
cosmicman-cvpr2024/CosmicMan
CosmicMan: A Text-to-Image Foundation Model for Humans (CVPR 2024)
sled-group/InfEdit
[CVPR 2024] Official implementation of CVPR 2024 paper: "Inversion-Free Image Editing with Natural Language"
AILab-CVC/CV-VAE
[NeurIPS 2024] CV-VAE: A Compatible Video VAE for Latent Generative Video Models
CFGpp-diffusion/CFGpp
Official repository for "CFG++: manifold-constrained classifier free guidance for diffusion models"
czg1225/AsyncDiff
[NeurIPS 2024] AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising
Zerg-Overmind/GaussianFlow
GaussianFlow: Splatting Gaussian Dynamics for 4D Content Creation
b0nes164/GPUSorting
State of the art sorting and segmented sorting, including OneSweep. Implemented in CUDA, D3D12, and Unity style compute shaders. Theoretically portable to all wave/warp/subgroup sizes.
yashkant/spad
Code for SPAD : Spatially Aware Multiview Diffusers, CVPR 2024
Costwen/Ouroboros3D
Ouroboros3D: Image-to-3D Generation via 3D-aware Recursive Diffusion
ai-med/StablePose
Official Pytorch Implementation of Paper - Stable-Pose: Leveraging Transformers for Pose-Guided Text-to-Image Generation - NeurIPS 2024
Litalby1/make-it-count
Official implemention of "Make It Count: Text-to-Image Generation with an Accurate Number of Objects"
philippe-eecs/small-vision
A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.
Dawn-LX/Causal-VideoGen
PipeFusion/DistVAE
A parallelism VAE avoids OOM for high resolution image generation
camenduru/ExVideo-jupyter
painebenjamin/fruition
The Fruition framework turbocharges Python web applications with a huge array of features and easy-to-use interface.