/Audio-File-Generation-with-Gradio

An interface that allow create sounds via prompts.

Primary LanguageJupyter Notebook

🎵 Project Description

The goal of this project is to create an interface using Gradio and the AudioLDM model hosted on Huggingface, allowing users to generate .wav audio files via prompt and download the file.

🛠️ Steps

  1. Create a process to generate the content
  2. Convert the generated content to a known format, in this case, .wav
  3. Build the interface that receives the prompt and the file name

📋 Requirements

The entire process was developed in Google Colab and then transferred to HuggingFace Spaces for deployment. The programming language used is Python.

Packages

Install the following packages in Python before running the code:

torch
Transformers
diffusers
gradio
scipy

👥 Author

Deployment on HuggingFace

Cllspy - GitHub