The data volume is 231 hours and is recorded by 406 speakers (from French, Canada, and Africa). The recording is in quiet environment and rich in content. It contains various fields like economics, entertainment, news, and spoken language. All texts are manually transcribed. The sentence accuracy rate is 95%.
For more details, please refer to the link: https://www.nexdata.ai/datasets/speechrecog/114?source=Github
16kHz, 16bit, uncompressed wav, mono channel
quiet indoor environment, without echo
economy, entertainment, news, oral language, numbers, letters
406 people from French, Canada and Rwanda etc., 52% of which are male
Android mobile phone, iPhone
French
text, time point of speech data, 5 noise symbols, special identifiers
95% (the accuracy rate of noise symbols and other identifiers is not included)
speech recognition, voiceprint recognition
Commercial License