alex4727's Stars
openai/openai-cookbook
Examples and guides for using the OpenAI API
danielgatis/rembg
Rembg is a tool to remove images background
NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
kornia/kornia
Geometric Computer Vision Library for Spatial AI
modelscope/DiffSynth-Studio
Enjoy the magic of Diffusion models!
Doubiiu/ToonCrafter
[SIGGRAPH Asia 2024, Journal Track] ToonCrafter: Generative Cartoon Interpolation
AILab-CVC/VideoCrafter
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
TencentARC/InstantMesh
InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models
LLaVA-VL/LLaVA-NeXT
Doubiiu/DynamiCrafter
[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
apple/ml-4m
4M: Massively Multimodal Masked Modeling
FoundationVision/LlamaGen
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
NVIDIA/NeMo-Aligner
Scalable toolkit for efficient model alignment
tianweiy/DMD2
(NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis
magic-research/piecewise-rectified-flow
PeRFlow: Piecewise Rectified Flow as Universal Plug-and-Play Accelerator (NeurIPS 2024)
fal-ai/aura-sr
AuraSR: GAN-based Super-Resolution for real-world
CompVis/zigma
A PyTorch implementation of the paper "ZigMa: A DiT-Style Mamba-based Diffusion Model" (ECCV 2024)
Ji4chenLi/t2v-turbo
Code repository for T2V-Turbo and T2V-Turbo-v2
magicien/GLTFQuickLook
macOS QuickLook plugin for glTF files
VITA-Group/Diffusion4D
"Diffusion4D: Fast Spatial-temporal Consistent 4D Generation via Video Diffusion Models", Hanwen Liang*, Yuyang Yin*, Dejia Xu, Hanxue Liang, Zhangyang Wang, Konstantinos N. Plataniotis, Yao Zhao, Yunchao Wei
3DTopia/GPTEval3D
[ CVPR 2024 ] Implementation for "GPT-4V(ision) is a Human-Aligned Evaluator for Text-to-3D Generation"
Taited/clip-score
Quick scripts to calculate CLIP text-image similarity
tcwang0509/TalkingHead-1KH
DQiaole/MemFlow
[CVPR 2024] MemFlow: Optical Flow Estimation and Prediction with Memory
ashawkey/objaverse_filter
naive filter of objaverse
malbergo/stochastic-interpolants
feizc/Dimba
Transformer-Mamba Diffusion Models
zewei-Zhang/GoodDrag
Ofiicial GoodDrag implementation.
Monalissaa/DisenDiff
[CVPR`2024, Oral] Attention Calibration for Disentangled Text-to-Image Personalization
keturn/sd-progress-demo
cheap views of intermediate Stable Diffusion results