500-Hours-Italian-Conversational-Speech-Data-by-Mobile-Phone

Description

About 700 speakers participated in the recording, and conducted face-to-face communication in a natural way. They had free discussion on a number of given topics, with a wide range of fields; the voice was natural and fluent, in line with the actual dialogue scene. Text is transferred manually, with high accuracy.

For more details, please refer to the link: https://www.nexdata.ai/datasets/1178?source=Github

Format

16kHz, 16bit, uncompressed wav, mono channel;

Recording Environment

quiet indoor environment, without echo;

Recording content

dozens of topics are specified, and the speakers make dialogue under those topics while the recording is performed;

Demographics

About 700 people.

Annotation

annotating for the transcription text, speaker identification and gender

Device

Android mobile phone, iPhone;

Language

Italian

Application scenarios

speech recognition; voiceprint recognition;

Accuracy rate

the word accuracy rate is not less than 98%

Licensing Information

Commercial License

Nexdata-AI/500-Hours-Italian-Conversational-Speech-Data-by-Mobile-Phone