/DiffScaler

A Novel Approach to Video Generation model and their Archives

novel-video-synthesis

Base mode

Open Source Video Generation Models to Checkout

Miscellaneous Video Generation Models

References & citations

Spatial & Temporal Transformer

https://huggingface.co/hotshotco/Hotshot-XL https://github.com/PKU-YuanGroup/Open-Sora-Plan

ControlNet

Distributive training

https://github.com/AUTOMATIC1111/stable-diffusion-webui https://github.com/comfyanonymous/ComfyUI https://github.com/facebookresearch/xformers https://github.com/Dao-AILab/flash-attention

FiT: Flexible Vision Transformer for Diffusion Model

SOTA Caption Generation for video : https://github.com/snap-research/Panda-70M https://github.com/willisma/SiT

Unet

https://github.com/Vchitect/Latte

VAE

image

AnimateDiff + freeinit