With the spirit of reproducible research, this repository contains codes required to produce the results in the manuscript:
P. Berjon, A. Nag, and S. Dev, On the Analysis of French Phonetic Idiosyncrasies for Accent Recognition, Soft Computing Letters, 2021.
Please cite the above paper if you intent to use whole/part of the code. This code is only for academic and research purposes.
Spectrogram of an English sample.
All codes are written in python3
.
The following libraries should be installed before the execution of the codes.
- numpy:
pip install numpy
- matplotlib:
pip install matplotlib
- opencv-python 4.3.0.36:
pip install opencv-python
- cv2:
pip install cv2
- tqdm:
pip install tqdm
- pycm:
pip install pycm
The audio samples can be found here : https://www.kaggle.com/rtatman/speech-accent-archive. GitHub doesn't allow me to give you all the spectrogram files generated, so I decided to give you at max 100 of each accent as an example. If you need all of it, feel free to use the dataset of spectrograms present on my personal Kaggle profile : https://www.kaggle.com/pberjon/accent-data.
Each Python Notebook gives one contribution of the article (and one model I have used):
accentrecognitionsvm.ipynb
: the SVMaccentrecognitioncnn2.ipynb
: the 2-layer CNNaccentrecognitioncnn4.ipynb
: the 4-layer CNN