paper2fig

There are 2 repositories under paper2fig topic.

  • joanrod/ocr-vqgan

    OCR-VQGAN, a discrete image encoder (tokenizer and detokenizer) for figure images in Paper2Fig100k dataset. Implementation of OCR Perceptual loss for clear text-within-image generation. Fork from VQGAN in CompVis/taming-transformers

    Language:Python752121
  • joanrod/paper2figure-dataset

    Pipeline to create Paper2Fig dataset, a dataset for text-to-image generation from research papers and figures (e.g., diagrams of architectures or methods in fields like Machine Learning or Computer Vision)

    Language:Python2200