clipit
Yet Another VQGAN-CLIP Codebase
This started as a fork of @nerdyrodent's VQGAN-CLIP code which was based on the notebooks of @RiversWithWings and @advadnoun. But it quickly morphed into a version of the code that had been tuned up with slightly different behavior and features. It also runs either at the command line or in a notebook or (soon) in batch mode.
Basically this is a verison of the notebook with opinionated defaults and slighly different internals. You are welcome to use it if you'd like.
For now, checkout THE DEMO NOTEBOOKS - especially the super simple "Start Here" colab.
Citations
@misc{unpublished2021clip,
title = {CLIP: Connecting Text and Images},
author = {Alec Radford, Ilya Sutskever, Jong Wook Kim, Gretchen Krueger, Sandhini Agarwal},
year = {2021}
}
@misc{esser2020taming,
title={Taming Transformers for High-Resolution Image Synthesis},
author={Patrick Esser and Robin Rombach and Björn Ommer},
year={2020},
eprint={2012.09841},
archivePrefix={arXiv},
primaryClass={cs.CV}
}
Katherine Crowson - https://github.com/crowsonkb Adverb https://twitter.com/advadnoun