This project combines the power of AI models from Hugging Face, LangChain, and OpenAI to create a seamless cooking assistant. With this tool, you can input an image of a dish, generate a detailed recipe, and listen to audio instructions for an enhanced cooking experience.
-
Image-to-Text Conversion: Uses Hugging Face's blip-image-captioning-base model to describe the dish from an image.
-
Recipe Generation: Utilizes OpenAI's GPT-3.5 to generate a list of ingredients and step-by-step cooking instructions based on the image caption.
-
Text-to-Speech Conversion: Converts the generated recipe into audio using gtts for easy listening while cooking.