150-People-Chinese-Mandarin-Average-Tone-Speech-Synthesis-Corpus-Customer-Service

Description

150 People - Chinese Mandarin Average Tone Speech Synthesis Corpus-Customer Service. It is recorded by Chinese native speakers,customer service text, and the syllables, phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

For more details, please refer to the link: https://www.nexdata.ai/datasets/tts/1100?source=Github

Format

48,000Hz, 16bit, uncompressed wav, mono channel;

Recording environment

professional recording studio;

Recording content

customer service text, and the syllables, phonemes and tones are balanced;

Speaker

150 speakers totally, with 50% male and 50% female;

Device

microphone;

Language

Mandarin;

Annotation

word and Pinyin transcription, four-level prosodic boundary annotation;

Application scenarios

speech synthesis.

Licensing Information

Commercial License

Nexdata-AI/150-People-Chinese-Mandarin-Average-Tone-Speech-Synthesis-Corpus-Customer-Service