Multilingual-Speech-Synthesis-Voice-Conversion Using Bark + RVC
- A Windows/Linux system with a minimum of
16GB
RAM. - A GPU with at least
12GB
of VRAM. - Python >= 3.8
- Anaconda installed.
- Pytorch installed.
- CUDA 11.7 installed.
Pytorch install command:
pip install torch==2.0.0+cu117 torchvision==0.15.1+cu117 torchaudio==2.0.1 --index-url https://download.pytorch.org/whl/cu117
CUDA 11.7 install:
https://developer.nvidia.com/cuda-11-7-0-download-archive
- Create an Anaconda environment:
conda create -n barkrvc python=3.9
- Activate the environment:
conda activate barkrvc
- Clone this repository to your local machine:
git clone https://github.com/ORI-Muchim/BARK-RVC.git
- Navigate to the cloned directory:
cd BARK-RVC
- Install the necessary dependencies:
pip install -r requirements.txt
If you open ./main.py
, There is sample text. There are many text prompts such as [laughs]
in the bark.
[laughter]
[laughs]
[sighs]
[music]
[gasps]
[clears throat]
—
or...
for hesitations♪
for song lyrics- CAPITALIZATION for emphasis of a word
[MAN]
and[WOMAN]
to bias Bark toward male and female speakers, respectively
Language | Status |
---|---|
English (en) | ✅ |
German (de) | ✅ |
Spanish (es) | ✅ |
French (fr) | ✅ |
Hindi (hi) | ✅ |
Italian (it) | ✅ |
Japanese (ja) | ✅ |
Korean (ko) | ✅ |
Polish (pl) | ✅ |
Portuguese (pt) | ✅ |
Russian (ru) | ✅ |
Turkish (tr) | ✅ |
Chinese, simplified (zh) | ✅ |
Voice Presets can be found here:
https://suno-ai.notion.site/8b8e8749ed514b0cbf3f699013548683?v=bc67cff786b04b50b3ceb756fd05f68c
Place the audio files ./datasets/{speaker_name}
.
.mp3 or .wav files are okay.
And, use the following command:
python main.py {speaker_name}
- Audio-Upsampling Using NU-Wave2 ✅
For more infomation, Please refer to the following repositories: