CASIA Emotion Speech Dataset

  • LANG: mandarin
  • Number of utterance: 9594
  • Number of speaker: 2 male and 2 female
  • Duration of dataset:
  • Sampling rate: 16kHz

CSTR VCTK Corpuslink

  • LANG: english
  • Duration of dataset: 44 hours
  • Number of speaker: 109