/Caption2Image

Term Project for the course EEE543 Neural Networks.

Primary LanguageJupyter Notebook

Caption2Image

This Project was developed as the term project for EEE543 Neural Networks in Bilkent University Electrical and Electronics Engineering M.S. program.

NOTE: The repo itself was not built to be useful for a 3rd party. However, you are welcome to experiment with the code.

Training

The model has been trained with the COCO dataset using the ideas from GLIDE and DALL·E 2. Both of these papers employ Diffusion Models for their training, which this project does not support.

Data

You can find some useful preprocessed data in this Google Drive folder. The descriptions for each of the files can be found in the data/ folder of this repo.

Authors

Abdullah Arda Aşcı
Ali Necat Karakuloğlu
Osman Buğra Aydın