harvard-edge/multilingual_kws
Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus
Jupyter Notebook
Issues
- 3
- 1
dataset of multilingual_context_73_0.8011
#42 opened by farzadhallaji - 3
- 0
Reproducing paper results
#39 opened by sathibault - 0
- 0
ERROR: Cannot find key when Running docker
#37 opened by manarsaaldossari - 5
Using multilingual_kws with microphone streaming
#34 opened by wesbz - 0
- 0
add version info to each tarball
#31 opened by mmaz - 1
GCS transfer
#28 opened by mmaz - 2
is Mozilla SWTS in the dataset?
#25 opened by mmaz - 0
Re-encode mp3/opus clips to exactly 1s
#8 opened by mmaz - 4
First time user
#3 opened by El-Yazid - 0
- 0
- 0
Filter out NaNs from Common Voice tsvs, distinguish between intentional "nan" in language vocabulary
#9 opened by mmaz - 3
found duplicates at __2 and __3 etc
#20 opened by mmaz - 1
Lithuanian clips not validated
#11 opened by Sharad24 - 1
Arabic word formatting
#29 opened by mmaz - 1
- 2
- 1
words greater than 2^3 are probably > 1s
#22 opened by mmaz - 1
generate a sibling dataset with speech context
#15 opened by mmaz - 1
Some percentage of wavs (~3%) are below 1s according to soxi, others can't be opened
#10 opened by mmaz - 0
Add TFDS integration/flow
#24 opened by colbybanbury - 0
Bug: Some languages (basque, polish) seem to have a higher total length duration than in original common Voice
#13 opened by Sharad24 - 0
- 1
- 0
- 0
Re-creating alignments for Common Voice 7
#12 opened by Sharad24 - 0
check for 16KHz in AudioDataset
#7 opened by mmaz - 1
is there the inference.py?
#4 opened by huacilang - 2
word counts
#1 opened by mmaz