500-Hours-Italian-Conversational-Speech-Data-by-Mobile-Phone
Description
About 700 speakers participated in the recording, and conducted face-to-face communication in a natural way. They had free discussion on a number of given topics, with a wide range of fields; the voice was natural and fluent, in line with the actual dialogue scene. Text is transferred manually, with high accuracy.
For more details, please refer to the link: https://www.nexdata.ai/datasets/1178?source=Github
Format
16kHz, 16bit, uncompressed wav, mono channel;
Recording Environment
quiet indoor environment, without echo;
Recording content
dozens of topics are specified, and the speakers make dialogue under those topics while the recording is performed;
Demographics
About 700 people.
Annotation
annotating for the transcription text, speaker identification and gender
Device
Android mobile phone, iPhone;
Language
Italian
Application scenarios
speech recognition; voiceprint recognition;
Accuracy rate
the word accuracy rate is not less than 98%
Licensing Information
Commercial License