Introduce：

This is a Uyghur language translator that supports speech-to-text in Uyghur language, and text-to-speech in Uyghur language. And Chinese to Uyghur and Uyghur to Chinese. Notice that we directly use the model from MMS (Meta) and NLLB (Meta).
You can find everything from:
https://huggingface.co/facebook/mms-1b-all
https://github.com/facebookresearch/fairseq/tree/main/examples/mms#asr
https://huggingface.co/facebook/nllb-200-distilled-600M

Usage

Installation

python >= 3.8
torch >= 1.13

git clone https://github.com/facebookresearch/fairseq.git
cd fairseq
pip install --editable ./ 
pip install tensorboardX

cd ..
git clone https://github.com/jaywalnut310/vits.git
cd vits
pip install Cython==0.29.21
pip install librosa
pip install phonemizer==2.2.1
pip install scipy
pip install numpy
pip install matplotlib
pip install Unidecode==1.1.1

cd monotonic_align/
mkdir monotonic_align
python3 setup.py build_ext --inplace

Preparation

MMS checkpoint download

wget https://dl.fbaipublicfiles.com/mms/tts/uig-script_arabic.tar.gz
tar -zxvf uig-script_arabic.tar.gz
wget https://dl.fbaipublicfiles.com/mms/asr/mms1b_all.pt'

NLLB checkpoint download

# run in python
model = AutoModelForSeq2SeqLM.from_pretrained("facebook/nllb-200-distilled-600M")
tokenizer_uig = AutoTokenizer.from_pretrained("facebook/nllb-200-distilled-600M", src_lang="uig_Arab")
model.save_pretrained("model/nllb-200-distilled-600M")
tokenizer_uig.save_pretrained("model/nllb-200-distilled-600M")

Example

Example of how to use mms to do ASR and TTS:

Example code

ASR_TTS_Run.ipynb

Example of how to use nllb to do Chinese to Uyghur and Uyghur to Chinese:

API

translator.translateArab2Latn()
translator.translateArab2Hans()
translator.translateLatn2Arab()
translator.translateLatn2Hans()
translator.translateHans2Arab()
translator.translateHans2Latn()

Example code

translators.py

vvirgooo2/Uyghur-Translator

Introduce：

Usage

Installation

Preparation

Example

Example of how to use mms to do ASR and TTS:

Example code

Example of how to use nllb to do Chinese to Uyghur and Uyghur to Chinese:

API

Example code