This project is just a simple python3 version of speaker-recognition
For the convenience, removed a lot of features I didn't use.
- Neither use MFCC implementation of bob nor implement that myself. Use the python_speech_features instead.
- Remove the GUI and you can only use the command line to train and predict the model.
- Replace the function and class in sklearn which will be removed in the later version.
usage: speaker-recognition.py [-h] -t TASK -i INPUT -m MODEL
Speaker Recognition Command Line Tool
optional arguments:
-h, --help show this help message and exit
-t TASK, --task TASK Task to do. Either "enroll" or "predict"
-i INPUT, --input INPUT
Input Files(to predict) or Directories(to enroll)
-m MODEL, --model MODEL
Model file to save(in enroll) or use(in predict)
Wav files in each input directory will be labeled as the basename of the directory.
Note that wildcard inputs should be *quoted*, and they will be sent to glob module.
Examples:
Train:
./speaker-recognition.py -t enroll -i "/tmp/person* ./mary" -m model.out
Predict:
./speaker-recognition.py -t predict -i "./*.wav" -m model.out