IoSonoMarco's Stars
AntixK/PyTorch-VAE
A Collection of Variational Autoencoders (VAE) in PyTorch.
neuraptic/cpe-clip
Official implementation of the "Multimodal Parameter-Efficient Few-Shot Class Incremental Learning" paper
archinetai/audio-diffusion-pytorch
Audio generation using diffusion models, in PyTorch.
dome272/VQGAN-pytorch
Pytorch implementation of VQGAN (Taming Transformers for High-Resolution Image Synthesis) (https://arxiv.org/pdf/2012.09841.pdf)
OpenGVLab/VideoMAEv2
[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
MCG-NJU/VideoMAE
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
lucidrains/transframer-pytorch
Implementation of Transframer, Deepmind's U-net + Transformer architecture for up to 30 seconds video generation, in Pytorch
diff-usion/Awesome-Diffusion-Models
A collection of resources and papers on Diffusion Models
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models