1730 Sichuan native speakers participated in the recording and face-to-face free talking in a natural way in wide fields without the topic specified. It is natural and fluency in speech, and in line with the actual dialogue scene. We transcribed the speech into text manually to ensure high accuracy.
For more details, please refer to the link: https://www.nexdata.ai/datasets/speechrecog/1065?source=Github
16kHz, 16bit, uncompressed wav, mono channel
quiet indoor environment, without echo
no topic is specified, and the speakers make dialogue while the recording is performed
1,730 people, 74% of which are female; 88% of 1,730 people are not more than 25 years old; people are from Sichuan or Chongqing
annotating for the transcription text, speaker identification and gender
Android mobile phone, iPhone
Sichuan dialect
speech recognition, voiceprint recognition.
Commercial License