vqvae

There are 40 repositories under vqvae topic.

fishaudio/fish-speech
SOTA Open Source TTS
Language:Python24k 136 6302k
AntixK/PyTorch-VAE
A Collection of Variational Autoencoders (VAE) in PyTorch.
Language:Python7.4k 40 911.2k
v-iashin/SpecVQGAN
Source code for "Taming Visually Guided Sound Generation" (Oral at the BMVC 2021)
Language:Jupyter Notebook367 8 3539
FoundationVision/OmniTokenizer
[NeurIPS 2024]OmniTokenizer: one model and one weight for image-video joint tokenization.
Language:Python316 5 218
k2kobayashi/crank
A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder
Language:Python171 9 2831
ZhengdiYu/SignAvatars
(ECCV 2024) SignAvatars: A Large-scale 3D Sign Language Holistic Motion Dataset and Benchmark
Language:Python134 9 1715
haoliuhl/language-quantized-autoencoders
Language Quantized AutoEncoders
Language:Python110 1 45
Vermeille/Torchelie
Torchélie is a set of utility functions, layers, losses, models, trainers and other things for PyTorch.
Language:Python110 7 2811
mahmoodlab/SISH
Fast and scalable search of whole-slide images via self-supervised deep learning - Nature Biomedical Engineering
Language:Python108 2 727
Neur-IO/OptVQ
Towards training VQ-VAE models robustly!
Language:Python85 1 102
hqyyqh888/RobustSemanComm
Demo of robust semantic communication against semantic noise
Language:Python82 1 619
explainingai-code/VQVAE-Pytorch
This repo implements VQVAE on mnist and as well as colored version of mnist images. It also implements simple LSTM for generating sample numbers using the encoder outputs of trained VQVAE
Language:Python59 1 09
vsimkus/vae-voice-conversion
Voice conversion (VC) investigation using three variants of VAE
Language:Python58 4 311
FoundationVision/BitVAE
official training and inference code of bitwise tokenizer
Language:Python51 2 32
SerezD/vqvae-vqgan-pytorch-lightning
VQ-VAE/GAN implementation in pytorch-lightning
Language:Python50 2 14
affjljoo3581/Inverse-DALL-E-for-Optical-Character-Recognition
Inverse DALL-E for Optical Character Recognition
Language:Python38 1 26
amzn/sparse-vqvae
Experimental implementation for a sparse-dictionary based version of the VQ-VAE2 paper
Language:Python34 5 014
MIMICLab/BITTERS
Large-Scale Bidirectional Training for Zero-Shot Image Captioning
Language:Python21 1 13
BhanuPrakashPebbeti/Image-Generation-Using-VQVAE
Image Generation using VQVAE and GPT Models
Language:Jupyter Notebook19 1 12
jaywalnut310/Vector-Quantized-Autoencoders
Tensorflow Implementation of "Theory and Experiments on Vector Quantized Autoencoders"
Language:Python14 3 03
sayedmohamedscu/VQGAN
Vector-Quantized Generative Adversarial Networks
Language:Python10 1 01
xnought/vq-vae-explainer
Interactive VQ-VAE (Vector-Quantized Variational Autoencoder) in the browser
Language:Jupyter Notebook6 1 10
lupalab/posterior-matching
Official code for the NeurIPS 2022 paper "Posterior Matching for Arbitrary Conditioning".
Language:Python4 1 10
mehdidc/vqgan_nodep
VQGAN from LDM without hell of dependencies
Language:Python4 1 0
aillaud/VQVAE_Flax
Implementation of basic autoencodeur, VAE and VQVAE in Flax
Language:Jupyter Notebook3 1 11
fostiropoulos/dvq
Applying multiple VQ along the feature axis
Language:Jupyter Notebook2 2 02
rogertrullo/VQVAE_Pytorch
implementation of VQVAE in pytorch
Language:Jupyter Notebook2 2 01
SnowYJ/T5VQVAE
Improving Semantic Control in Discrete Latent Spaces with Transformer Quantized Variational Autoencoders
Language:Python2 1 10
aillaud/Diffusion-models
State of the art of generative models and in-depth study of diffusion models
Language:Jupyter Notebook1 2 00
DYZhang09/VQ-VAE
naive pytorch implementation of VQ-VAE
Language:Python1 1 01
maj34/facegram
[GDSC Solution Challenge] Facegram All-Part Merged Repository
Language:Jupyter Notebook1 0 01
MissMeriel/PreFixer
Learning universal transformations between perception datasets to overcome sensor hardware versioning
Language:Python1 1 00
viviaxenov/text_to_image_with_transformer
An educational project dedicated to text-to-image generation with neural networks. VQVAE and BPE autoencoders are used to learn the embedding of text and image respectively. A transformer-based model then is trained to predict the next token in the concatenated sequence of image and text tokens and used for generation.
Language:Python1 0 00
jkyl/vq-vae
A JAX / NNX implementation of a VQ-VAE for audio compression
Language:Jupyter Notebook0 1 00
unshun0120/Apply-FederatedLearning-into-Autoencoder
Using Federated Learning to train Autoencoder and its variants' models in pytorch
Language:Python0 1 00
UbitonAI/experiments
Experiments
Language:Jupyter Notebook

vqvae

fishaudio/fish-speech

AntixK/PyTorch-VAE

v-iashin/SpecVQGAN

FoundationVision/OmniTokenizer

k2kobayashi/crank

ZhengdiYu/SignAvatars

haoliuhl/language-quantized-autoencoders

Vermeille/Torchelie

mahmoodlab/SISH

Neur-IO/OptVQ

hqyyqh888/RobustSemanComm

explainingai-code/VQVAE-Pytorch

vsimkus/vae-voice-conversion

FoundationVision/BitVAE

SerezD/vqvae-vqgan-pytorch-lightning

affjljoo3581/Inverse-DALL-E-for-Optical-Character-Recognition

amzn/sparse-vqvae

MIMICLab/BITTERS

BhanuPrakashPebbeti/Image-Generation-Using-VQVAE

jaywalnut310/Vector-Quantized-Autoencoders

sayedmohamedscu/VQGAN

xnought/vq-vae-explainer

lupalab/posterior-matching

mehdidc/vqgan_nodep

aillaud/VQVAE_Flax

fostiropoulos/dvq

rogertrullo/VQVAE_Pytorch

SnowYJ/T5VQVAE

aillaud/Diffusion-models

DYZhang09/VQ-VAE

maj34/facegram

MissMeriel/PreFixer

viviaxenov/text_to_image_with_transformer

jkyl/vq-vae

unshun0120/Apply-FederatedLearning-into-Autoencoder

UbitonAI/experiments