This project implements an Image Caption Generator using a combination of Convolutional Neural Networks (CNNs) and Recurrent Neural Networks (RNNs). The goal is to generate descriptive captions for input images.
- Python 3.x
- TensorFlow
- VisionEncoderDecoderModel
- ViTFeatureExtractor
- AutoTokenizer
- Clone the repository:
git clone https://github.com/praveenliyanage/image-caption-generator-cnn.git cd image-caption-generator
- !pip install transformers