hyc9's Stars
facebookresearch/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
NUS-HPC-AI-Lab/OpenDiT
OpenDiT: An Easy, Fast and Memory-Efficient System for DiT Training and Inference
3017218062/Pytorch-Lightning-Learning
Pytorch Lightning入门中文教程,转载请注明来源。(当初是写着玩的,建议看完MNIST这个例子再上手)
Vchitect/Latte
Latte: Latent Diffusion Transformer for Video Generation.
google-research/magvit
Official JAX implementation of MAGVIT: Masked Generative Video Transformer
aigc-apps/EasyAnimate
📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion
FoundationVision/OmniTokenizer
OmniTokenizer: one model and one weight for image-video joint tokenization.
TencentARC/Open-MAGVIT2
Open-MAGVIT2: Democratizing Autoregressive Visual Generation
google-research/pix2seq
Pix2Seq codebase: multi-tasks with generative modeling (autoregressive and diffusion)
NovelAI/novelai-aspect-ratio-bucketing
Implementation of aspect ratio bucketing for training generative image models as described in: https://blog.novelai.net/novelai-improvements-on-stable-diffusion-e10d38db82ac
ppingzhang/Awesome-Deep-Learning-Based-Video-Compression
Paper list: deep learning based video compression
Netflix/vmaf
Perceptual video quality assessment based on multi-method fusion.
AILab-CVC/CV-VAE
CV-VAE: A Compatible Video VAE for Latent Generative Video Models
DeepAI-Research/OpenSoraTraining
A hacker's guide to training Open Sora Plan on your custom dataset and GPUs
apple/ml-mgie
wilson1yan/teco
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
CompVis/latent-diffusion
High-Resolution Image Synthesis with Latent Diffusion Models
qiuyu96/CoDeF
[CVPR 2024 Highlight] Official PyTorch implementation of CoDeF: Content Deformation Fields for Temporally Consistent Video Processing
Stability-AI/generative-models
Generative Models by Stability AI
ali-vilab/VGen
Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models
MichalGeyer/plug-and-play
Official Pytorch Implementation for “Plug-and-Play Diffusion Features for Text-Driven Image-to-Image Translation” (CVPR 2023)
evalcrafter/EvalCrafter
[CVPR 2024] EvalCrafter: Benchmarking and Evaluating Large Video Generation Models
amusi/CVPR2024-Papers-with-Code
CVPR 2024 论文和开源项目合集
Picsart-AI-Research/StreamingT2V
StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text
daooshee/HD-VG-130M
The HD-VG-130M Dataset
OpenGVLab/InternVideo
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
CompVis/stable-diffusion
A latent text-to-image diffusion model
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
guoyww/AnimateDiff
Official implementation of AnimateDiff.