This project utilizes deep learning techniques to automatically generate descriptive captions for images. It also contains a self-hosted web server through Flask for users to upload their own images.
- Automatically generates captions for uploaded images.
- Uses a pretrained deep learning model to understand image content.
- Provides a seamless integration of computer vision and natural language processing.
-
Clone this repository to your local machine using:
git clone https://github.com/Asatheesh6561/End-To-End-Image-Captioning.git
-
Navigate to the project directory:
cd End-To-End-Image-Captioning
-
Install the required dependencies:
pip install -r requirements.txt
- Train the CNN-Transformer model by running this command:
python main.py
- Run the prediction program on your local machine hosted by Flask using this command:
python app.py