Inverse Cooking recipe Generation from food images

An auto encoder-decoder with transformer based system to predict the recipe of food from its images.

Reference Paper

Network

Requirements

numpy
scipy
matplotlib
nltk
Pillow
tqdm
lmdb
tensorflow
tensorboardX
Pytorch 0.4.1

Pre-requisites

Transformer
Encoders and Decoders
Attention networks
RNNs
LSTMs

Dataset

The Recipe1M dataset composed of 1 029 720 recipes scraped from cooking websites. The dataset contains 720 639 training, 155 036 validation and 154 045 test recipes, containing a title, a list of ingredients, a list of cooking instructions and (optionally) an image.

Optimisation

In the first stage, we pre-train the image encoder and ingredients decoder. Then, in the second stage, we train the ingredient encoder and instruction decoder by minimizing the negative log-likelihood and adjusting θR and θE.

Pre-Trained model

Find ingredient vocabulary https://dl.fbaipublicfiles.com/inversecooking/ingr_vocab.pkl
Find instruction vocabulary https://dl.fbaipublicfiles.com/inversecooking/instr_vocab.pkl
Find pre-trained model here https://dl.fbaipublicfiles.com/inversecooking/modelbest.ckpt

adasrinivas1229/Inverse_Cooking_recipe_Generation_from_food_images