text-generation-webui-barktts-plus-rvc (WIP)

A simple extension for the text-generation-webui by oobabooga that uses Bark and Retrieval-based-Voice-Conversion Project for audio output.

How to install

Assuming you already have the webui set up:

Activate the conda environment with the cmd_xxx.bat or using conda activate textgen
Enter the text-generation-webui/extensions/ directory and clone this repository

cd text-generation-webui/extensions/
git clone --recurse-submodules https://github.com/Sergey004/text-generation-webui-barktts-plus-rvc bark_rvc_tts/

install the requirements

pip install -r extensions/bark_tts/requirements.txt

Add --extensions bark_rvc_tts to your startup script
or
enable it through the Interface Mode tab in the webui

Tips

The full version of Bark requires around 12Gb of memory to hold everything on GPU at the same time. However, even smaller cards down to ~2Gb work with some additional settings. For this extension, you could open extensions/bark_rvc_tts/.env, then set USE_SMALL_MODELS and USE_CPU to true:

# Whether to use small models
USE_SMALL_MODELS=true

# Whether to use CPU
USE_CPU=true