huggingface/distil-whisper
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
PythonMIT
Issues
- 2
Distil-Whisper model sometimes fails with "index x is out of bounds for dimension 0 with size y" error
#148 opened by aleksandr-mokrov - 1
How to load a fine-tuned model for inference?
#145 opened by xinliu9451 - 0
Is there a way to run this on a browser?
#157 opened by FerLuisxd - 1
distil-whisper-turbo
#154 opened by simpthy - 0
Speculative Decoding: TypeError: list indices must be integers or slices, not tuple (Apple M1 MacOS Sonoma 14.6.1)
#153 opened by solitaryangler - 3
Repository Not Found for url
#136 opened by wailokkwok - 0
Using run distillation lead to a high .cache use
#151 opened by Gusreis7 - 0
- 0
- 0
- 0
How many hours for the French version
#146 opened by virtualmartire - 0
Cannot continue training for custom dataset
#144 opened by alicewith - 0
The fine-tuning issue regarding this project
#143 opened by lq0104 - 0
- 0
Problems in concatenate_dataset
#129 opened by George0828Zhang - 0
ZeroDivisionError: division by zero
#137 opened by wailokkwok - 1
Discrepancy on WER benchmark result in Tedlium dataset
#135 opened by GeeYangML - 6
Unable to reproduce results from the paper
#131 opened by GeeYangML - 7
- 3
- 3
Resuming training fails
#105 opened by hidoba - 0
question about when to apply WER threshold filtering strategy with concatenated audio
#127 opened by lq0104 - 1
Finetuning on which model?
#104 opened by RohitMidha23 - 2
Whisper-tiny support?
#84 opened by alicewith - 0
- 0
- 0
Quantize distil-whisper?
#113 opened by sujitvasanth - 2
- 6
- 10
- 0
[Issue] latest run_pseudo_labelling.py
#106 opened by ckcraig01 - 4
- 3
WER Filtering takes too long?
#80 opened by macabdul9 - 3
distil-small.en AttributeError
#90 opened by andrewjones0198 - 1
Errors running `run_long_form_eval.py`
#87 opened by guynich - 1
Why do we need to tokenized file_id?
#82 opened by macabdul9 - 1
large-v2 for english lost voice to text
#100 opened by machenme - 1
- 1
- 1
Pseudo-labelling librispeech_asr (train.360): KeyError `train-360` when not streaming.
#96 opened by guynich - 1
Cached English Common Voice dataset size.
#94 opened by guynich - 0
transcription results are inconsistent and timestamps are None type. Issue appears in the latest version of the transformers==4.38.1.
#91 opened by kranipa - 3
Loss Weight ablation experiment
#75 opened by JinYu1998 - 2
Cannot do pseudo-labelling due to some error
#83 opened by alicewith - 0
Best way to implement streaming application?
#89 opened by 9throok - 3
- 3
Training README triggers BuildConfig ValueError
#86 opened by guynich - 1
Avoid preprocessing dataset again?
#78 opened by marianbastiUNRN - 1
- 1
Is there the "distil-whisper/distil-medium" model?
#72 opened by bk111