This project use mfcc feature extractor and Hidden Markove Model classification algorithm to recognition 0 - 9 digit of Kaggle dataset.
- Dataset of Kaggle : https://www.kaggle.com/divyanshu99/spoken-digit-dataset
- Hidden Markove Model
- Mel-frequency Cepstrum
This code is written in python. To use it you will need:
- python3
- hmmlearn
- librosa.feature
- numpy
- librosa
- random Use pip to install any missing dependencies
- Split train and test data
- Feature extract each audio using mfcc
- Transpose each audio signal matrix
- Vstack each transpose audio signal matrix
- Create Hidden Markov Modle
- Test Audio signal and predict them
94 %
Run python SDR.py