35-Hours-Pinyin-Annotation-Speech-Data-of-Audio-Book-Text

Description

Audiobook annotated pinyin audio data, with duration of 35 hours; 5 speakers are recorded including 3 males and 2 females; Chinese characters and pinyin are annotated, including the tone of pinyin; this data set can be used for automatic speech recognition, machine translation, and voiceprint recognition.

For more details, please refer to the link: https://www.nexdata.ai/datasets/speechrecog/243?source=Github

Format

44.1kHz, 16bit, uncompressed wav, mono channel

Environment

Relatively quiet environment

Recording Content

Audio books, including five categories like beautiful essays, novel, logical thinking, children's story, and Twenty Years in Late Qing Dynasty.

People

5 people in total and 3 males and 2 females

Language

Mandarin

Application Scenario

Voice Recognition, Voice Print Recognition

Annotation Feature

Annotating audio data with Chinese and Pinyin.

Licensing Information

Commercial License

Nexdata-AI/35-Hours-Pinyin-Annotation-Speech-Data-of-Audio-Book-Text