738-Hours-Uyghur-Speech-Data-by-Mobile-Phone

Description

It collects 2,058 people from the Uighur community, with a balanced ratio of men and women. The recording contents are 300,000 Uighur spoken sentences, and the recording environment is quiet indoor. All sentences were manually and accurately transcribed and annotated with noise signs.

For more details, please refer to the link: https://www.nexdata.ai/datasets/speechrecog/46?source=Github

Format

16kHz, 16bit, uncompressed wav, mono channel

Recording Environment

quiet indoor environment, without echo

Recording Content

oral sentences

Speaker

2,508 people, 53% of which are female

Device

Android mobile phone, iPhone

Language

Uyghur

Transcription content

text

Accuracy rate

95%

Application scenarios

speech recognition, voiceprint recognition

Licensing Information

Commercial License

Nexdata-AI/738-Hours-Uyghur-Speech-Data-by-Mobile-Phone