41-Hours-Chinese-Young-Children-Speech-Data-by-Mobile-Phone-and-Microphone

Description

The data were recorded by 797 Chinese children aged 3 to 5, of whom 39% were children aged 5. The recording content conforms to the characteristics of children, mainly storybooks, children's songs, spoken language. Around 120 sentences for each speaker. It is simultaneously recorded by hi-fi microphone and cellphone. The vaild data are 41.8 hours. Texts are manually transcribed with high accuracy.

For more details, please refer to the link: https://www.nexdata.ai/datasets/speechrecog/76?source=Github

Format

16kHz/22.05kHz/44.1kHz, 16bit, uncompressed wav, mono channel

Recording Environment

quiet indoor environment, without echo

Recording Content

general category, children's songs, storybooks, human-machine interaction, numbers, letters

Population

797 people, 49% of which are female

Device

recorded by mobile phone and microphone; Android mobile phone and iPhone

Language

Mandarin

Transcription content

text, noise symbols

Application scenarios

speech recognition; voiceprint recognition

Licensing Information

Commercial License

Nexdata-AI/41-Hours-Chinese-Young-Children-Speech-Data-by-Mobile-Phone-and-Microphone