common-voice/cv-dataset
Metadata and versioning details for the Common Voice dataset
JavaScriptMPL-2.0
Issues
- 1
- 0
A small request for column & field naming
#34 opened by HarikalarKutusu - 4
How many peoples in all dataset?
#33 opened by wntg - 6
Some mp3 files in cv corpus 4 are empty
#31 opened by yundaqwe - 1
Minor Bug in Text Corpus calculations
#30 opened by HarikalarKutusu - 2
FEATURE REQUEST: Please add `duration` as a metadata item that is included in the `*.tsv` files with a release
#28 opened by KathyReid - 0
有没有中文native-speaker能帮忙解释下各个*.tsv的意思
#29 opened by Liujingxiu23 - 1
FEATURE REQUEST: Make the `.tsv` files that are part of a downloaded dataset available separately
#26 opened by KathyReid - 0
- 3
Error: Version 15 summary data does not contain nested objects for splits (age, gender) and buckets (validation)
#25 opened by KathyReid - 3
need label about sample clean or noisy
#23 opened by JohnHerry - 1
Wrong checksums for Common Voice Corpus 13.0
#21 opened by paniedziela - 3
- 2
- 3
Feature request: CSV
#14 opened by bulvara - 3
Wrong duration value in ar v10.0
#17 opened by HarikalarKutusu - 0
- 0
- 3
Feature request: Summary data of each language including rows with metadata, gender, age, accent distribution
#7 opened by KathyReid - 3
- 0
Feature request: list sampling rates in dataset, download dataset given sampling rate
#10 opened by rafaelvalle - 0
Add CV 8.0 metadata
#8 opened by JRMeyer - 4
Download format is .tar instead of .tar.gz
#2 opened by HaritYadav - 1
.tsv files not found
#4 opened by zuther77 - 1
Script to download all the datasets
#5 opened by harrygcoppock - 5
Fix stereo files to mono
#1 opened by Mte90