Welcome to the Generative-AI-Models repository! This repository contains a collection of generative AI models implemented using Python and popular libraries like transformers
, torch
, diffusers
, and more. These models can be used for various generative tasks such as image captioning, text-to-image generation, and more.
To get started with the models in this repository, you can use Google Colab, which provides a free and powerful environment for running your code with GPU acceleration.
Before using the models, make sure you have the following:
- A Google account to access Google Colab.
- Basic knowledge of Python and deep learning concepts.
We have provided Google Colab templates for each model to ensure that you can run them quickly and efficiently. Follow the steps below to get started:
-
Open Google Colab: Click the links provided below for each model to open the respective Colab notebook.
-
Connect to a GPU: In Colab, go to
Runtime
>Change runtime type
, and selectGPU
as the hardware accelerator. -
Run the Notebook: Follow the instructions within the notebook to run the cells step by step. The models are pre-configured to run efficiently on Colab's environment.
Below is a list of available models in this repository along with their corresponding Google Colab templates:
Generate images from textual descriptions using the Stable Diffusion model.
- Model Overview: Uses a diffusion model to create high-quality images from text prompts.
- Colab Template: text_to_image_generator
- Image Generation Output:
Generate descriptive captions for images using the Vision Transformer (ViT) and GPT-2 models.
- Model Overview: Combines ViT for image processing and GPT-2 for text generation.
- Colab Template: image_to_text_generator
- Text Generation Output:
We welcome contributions from the community! If you have a model implementation you'd like to add or improvements to suggest, please follow these steps:
- Fork the repository.
- Create a new branch:
git checkout -b feature/YourFeatureName
. - Make your changes and commit them:
git commit -m 'Add some feature'
. - Push to the branch:
git push origin feature/YourFeatureName
. - Submit a pull request.
This project is licensed under the MIT License.
- Hugging Face for providing the
transformers
anddiffusers
libraries. - Google Colab for offering a free and powerful platform for running deep learning models.