This repository is part of my participation in Hugging Face Fine Tuning week of XLRS Wav2Vec2 on Common Voice Corpus 4 Arabic dataset.
The mini_arabic.ipynb notebook contains all data preprocessing and training steps.
The evaluation.ipynb notebook contains testing steps.
https://huggingface.co/anas/wav2vec2-large-xlsr-arabic
https://commonvoice.mozilla.org/en/datasets
https://github.com/saobou/arabic-text-preprocessing/blob/master/Preprocess.ipynb