Accent Trainer

Accent Trainer is Flask webapp/endpoint that compares the user's speech with different accents and assigns similarity scores based on speed, voice (DTW/MFCC), and accuracy. The accents are generated from Amazon Polly and accuracy analysis using Bing Speech API speech to text.

Performance

The distance results were compared against a model (i.e. same Polly voice and text), different accents (different Polly voice and same text), and negative examples (different Polly voice and different text). It performed as expected, with the model scoring 100%, similar accents scoring higher than dissimilar accents (i.e. US English rating highly with US English versus Portuguese accent), and negative examples performing the worst.

Install

Install Anaconda for Python 3.6. If space is a problem, you can also use pip or Miniconda to install the dependencies.
git clone the repository and cd into it.
You may then choose to install the rest of the dependencies in a virtual environment or not (as well as the pip or conda method).
1. conda install -c conda-forge librosa
2. pip install pysoundfile
3. pip install SpeechRecognition
4. pip install python_speech_features
5. pip install cydtw
6. pip install Flask-WTF
Register for Amazon Web Services, install the CLI and configure it. You might also need to pip install boto3.
Register for Microsoft Azure and get a Bing Speech API key. Insert it into BING_KEY in functions.py.
Write your own secret key in app.secret_key in app.py
Modify the grade calculations under compare() and compare_json(). These are arbitrary so you might want your own formula.
python app.py

Contribute

Feel free to post issues and make pull requests.

Deploy

This is a skeleton for a more fully developed server-side solution. Feel free to contact me via my website.

spaceraccoon/accent-trainer

Accent Trainer

Performance

Install

Contribute

Deploy