This repository contains code and resources for image caption generator using deep learning models. The project aims to develop accurate and efficient models for describing an image in words by the model.
The Image Caption Generator is a deep learning based project that automatically generates captions for images using state-of-the-art computer vision techniques and natural language processing models.
- Automatic image captioning using deep learning models.
- Pre-trained models for caption generation.
- Support for various image formats.
- Clone the repository to your local machine:
git clone https://github.com/ArnabKumarRoy02/Image-Caption-Generator.git
- Install the required dependencies:
pip install -r requirements.txt
- Make sure you have the required dependencies installed.
- Run the
app.py
script to test it:
flask run
- Optionally, you can fine-tune or train your own models using the provided dataset and scripts.
The Image Caption Generator uses the Flickr8k dataset for training. Make sure to download the dataset or use your own dataset for training.
We provide pre-trained models that can be use for generating captions without the need for training. You can download the pre-trained models from the following links:
Contributions to this project are welcome! If you find any issues or want to add new features, feel free to open an issue or submit a pull request.
This project is licensed under the MIT License
- The Image Caption Generator project is built upon the works of several open-source libraries and research papers. We acknowledge the contributions of the research community and the creators of the datasets used in this project.