Exploratory data analysis of a dataset of 91,000 recipes scraped from allrecipes.com
ingredient-phrase-tagger was used to parse ingredients into units, quantities, and names.
Unpack the compressed JSON file:
tar -xvf enriched_recipes.tar
enriched_recipes.json
contains recipe data that has been enriched by
ingredient-phrase-parser
ABBREV.csv
contains data from the SR28 dataset published by the USDA,
which contains the weights/densities of many ingredients.