/SpeechEmotionRecognition

Recognize the emotion in the speech using various machine learning and deep learning models.

Primary LanguageJupyter Notebook

Speech Emotion Recognition

MainImage.JPG

Problem Statement:

Classify the emotion in the audio speech file using various machine learning and deep learning models.

Dataset: RAVDESS

  • To do the analysis, I’ve explored datasets containing audio and video files by professional actors in different emotional sequences (happy, angry, sad, surprised, frightened, neutral and calm).
  • Since we are primarily focusing on processing the speech, the audio(WAV format) has been extracted from the video files(MP4 format). Then, the speech features such as, Mel Frequency Cepstrum Coefficients (MFCC) has been generated from the speech.

Metrics after applying CNN (Convolutional Neural Network) to the dataset:

ClassificationReport.JPG

ModelAccuracy.JPG

ModelLoss.JPG

ConfusionMatrix.JPG

ModelSummary.JPG