/simpleASR

Primary LanguagePythonMIT LicenseMIT

Simple ASR model

  1. Extract features (like MFCC) from the acoustic signal
  2. Feed an ML model (classification or clustering) with the features
  3. Recognize the speech input

Feature extraction

which model is used?

ML model

which model is used?