vqvae
There are 38 repositories under vqvae topic.
fishaudio/fish-speech
SOTA Open Source TTS
AntixK/PyTorch-VAE
A Collection of Variational Autoencoders (VAE) in PyTorch.
v-iashin/SpecVQGAN
Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)
FoundationVision/OmniTokenizer
[NeurIPS 2024]OmniTokenizer: one model and one weight for image-video joint tokenization.
k2kobayashi/crank
A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder
ZhengdiYu/SignAvatars
(ECCV 2024) SignAvatars: A Large-scale 3D Sign Language Holistic Motion Dataset and Benchmark
Vermeille/Torchelie
Torchélie is a set of utility functions, layers, losses, models, trainers and other things for PyTorch.
haoliuhl/language-quantized-autoencoders
Language Quantized AutoEncoders
mahmoodlab/SISH
Fast and scalable search of whole-slide images via self-supervised deep learning - Nature Biomedical Engineering
hqyyqh888/RobustSemanComm
Demo of robust semantic communication against semantic noise
zbr17/OptVQ
Towards training VQ-VAE models robustly!
explainingai-code/VQVAE-Pytorch
This repo implements VQVAE on mnist and as well as colored version of mnist images. It also implements simple LSTM for generating sample numbers using the encoder outputs of trained VQVAE
vsimkus/vae-voice-conversion
Voice conversion (VC) investigation using three variants of VAE
SerezD/vqvae-vqgan-pytorch-lightning
VQ-VAE/GAN implementation in pytorch-lightning
affjljoo3581/Inverse-DALL-E-for-Optical-Character-Recognition
Inverse DALL-E for Optical Character Recognition
amzn/sparse-vqvae
Experimental implementation for a sparse-dictionary based version of the VQ-VAE2 paper
MIMICLab/BITTERS
Large-Scale Bidirectional Training for Zero-Shot Image Captioning
BhanuPrakashPebbeti/Image-Generation-Using-VQVAE
Image Generation using VQVAE and GPT Models
jaywalnut310/Vector-Quantized-Autoencoders
Tensorflow Implementation of "Theory and Experiments on Vector Quantized Autoencoders"
sayedmohamedscu/VQGAN
Vector-Quantized Generative Adversarial Networks
xnought/vq-vae-explainer
Interactive VQ-VAE (Vector-Quantized Variational Autoencoder) in the browser
lupalab/posterior-matching
Official code for the NeurIPS 2022 paper "Posterior Matching for Arbitrary Conditioning".
mehdidc/vqgan_nodep
VQGAN from LDM without hell of dependencies
aillaud/VQVAE_Flax
Implementation of basic autoencodeur, VAE and VQVAE in Flax
fostiropoulos/dvq
Applying multiple VQ along the feature axis
maj34/facegram
[GDSC Solution Challenge] Facegram All-Part Merged Repository
rogertrullo/VQVAE_Pytorch
implementation of VQVAE in pytorch
aillaud/Diffusion-models
State of the art of generative models and in-depth study of diffusion models
DYZhang09/VQ-VAE
naive pytorch implementation of VQ-VAE
SnowYJ/T5VQVAE
Improving Semantic Control in Discrete Latent Spaces with Transformer Quantized Variational Autoencoders
viviaxenov/text_to_image_with_transformer
An educational project dedicated to text-to-image generation with neural networks. VQVAE and BPE autoencoders are used to learn the embedding of text and image respectively. A transformer-based model then is trained to predict the next token in the concatenated sequence of image and text tokens and used for generation.
filipposchiazza/Transformer
Torch implementation of minGPT for images latent code generation
jkyl/vq-vae
A JAX / NNX implementation of a VQ-VAE for audio compression
unshun0120/Apply-FederatedLearning-into-Autoencoder
Using Federated Learning to train Autoencoder and its variants' models in pytorch
MissMeriel/PreFixer
Accepted to IEEE Robotics and Automation Letters (RA-L) April 2024