taming-transformers
There are 6 repositories under taming-transformers topic.
joanrod/ocr-vqgan
OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Perceptual loss for clear text-within-image generation. Fork from VQGAN in CompVis/taming-transformers
rosinality/taming-transformers-pytorch
Implementation of Taming Transformers for High-Resolution Image Synthesis (https://arxiv.org/abs/2012.09841) in PyTorch
sbmagar13/VQGAN-CLIP-Text-to-Image
Text-to-Image Synthesis using Multimodal (VQGAN + CLIP) Architectures
krishnakaushik25/VQGAN-CLIP
Gradio Web app for running VQGAN-CLIP locally
mehdidc/vqgan_nodep
VQGAN from LDM without hell of dependencies
eduardotakemura/text-to-image-generator
Text-to-Image Multimodal Generator which generate images from text-prompts inputs