Spoken-Digit-Recognition

Input - speech signal, output - digit number

It contains :-

  1. Reading the dataset and preprocessing the data set.

  2. Training the LSTM with RAW data.

  3. Converting to spectrogram and Training the LSTM network

  4. Creating the augmented data and doing step 2 and 3 again.