This application demonstrates a simple Voice Gender Detection. Voice gender identification relies on three important steps.
- Extracting from the training set MFCC features (13 usually) for each gender
- Train a GMM on those features
- In prediction, compute the likelihood of each gender using the trained GMM, and pick the most likely gender
First, clone the project or download it locally. Then, install the requirements using:
pip install -r requirements.txt
The notebook contains the training code and the generation of the male.gmm
and female.gmm
files. You can just run the code of the notebook to retrain them.
To run the application locally, simply run:
streamlit run app.py
The accuracy in test remains to improve (65%). When a user plays with his or her voice and tries to immitate the other gender, the GMM gets fooled and predicts the wrong gender. This is also due to the training data that it has seen so far which were extracted from AudioSet and Youtube.