yeyupiaoling/Whisper-Finetune

Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deployment, Windows desktop deployment, and Android deployment

CApache-2.0

Issues

Push to Hub Support
#78 opened a month ago by sanchit-gandhi
0
finetune whisper-large-v3 的时候，中间模型解码会出现乱码和token多次重复的情况
#77 opened 2 months ago by Meeny2018
0
whisper全量微调相关问题
#76 opened 2 months ago by wangyarududu
0
tools/create_wenetspeech_data.py报错
#75 opened 2 months ago by Larry-Ye
0
是否支持mp4 视频
#74 opened 2 months ago by lizhanyang505
0
Belle-whisper-large-v3-zh 没法输出标点符号
#73 opened 2 months ago by jianchaozhuang
0
微调方言，数据标注为chinese还是新建一个方言的类型呢？
#72 opened 3 months ago by veritastry
1
NaN or Inf found in input tensor
#70 opened 3 months ago by GoldenLinlin
2
建議您整理一下 requirements.txt 及 readme.md
#71 opened 3 months ago by bardenthenry
1
使用large-v2-finetune，高機率出現重複內容…
#67 opened 3 months ago by zero-zen
2
能結合 pyannote 嗎?
#62 opened 3 months ago by HKLee2040
1
如何导出pt格式的模型？
#69 opened 3 months ago by jinchao123
1
训练出的模型如何导出npz格式
#68 opened 3 months ago by jackwenshann
1
evaluate运行时出现问题
#56 opened 4 months ago by yfwang214
6
训练到eval值的步数时报错：AttributeError: 'NoneType' object has no attribute 'get'
#60 opened 3 months ago by sangzige
1
ubuntu20.04环境问题
#66 opened 3 months ago by happylsr
8
transcribe时如何产生时间戳？
#64 opened 3 months ago by Koyonyong
2
微调模型保存时需要连接huggingface
#63 opened 3 months ago by qiangduscu
1
模型微调后出现乱码
#59 opened 3 months ago by hcguoO0
1
能提供fine-tune 模型的原始 checkpoint (pt) 吗？
#65 opened 3 months ago by walletiger
1
训练发生异常
#28 opened 10 months ago by ILG2021
7
请问如何导出onnx
#61 opened 3 months ago by drilistbox
1
请问加速推理可以离线使用吗
#57 opened 3 months ago by 20246688
1
[CONTRIBUTION] Speech dataset Generator
#58 opened 4 months ago by davidmartinrius
0
用BELLE-2/Belle-whisper-large-v2-zh识别中文音频，效果还不如Systran/faster-whisper-large-v2？
#54 opened 4 months ago by drilistbox
2
可以使用initial_prompt做微调吗
#53 opened 4 months ago by v-yunbin
1
使用 ct2 轉換後掉辨識率
#52 opened 4 months ago by ken19980727
2
可以单独导出我们微调模型训练出的数据吗
#50 opened 5 months ago by jackwenshann
2
可以微调新的语种吗？
#48 opened 5 months ago by kli017
1
WhisperForConditionalGeneration 與 AutoModelForSpeechSeq2Seq 差異
#51 opened 5 months ago by ken19980727
1
whisper large v3 Fine-Tune 後變得不太能辨識語音
#43 opened 6 months ago by bardenthenry
28
123
#49 opened 6 months ago by 20246688
2
如何转换V3版本
#47 opened 6 months ago by xyx361100238
4
資料格式的 language 設定
#46 opened 6 months ago by ken19980727
1
config.json文件与huggingface上的config.json不一样
#45 opened 6 months ago by hahazei
1
微调时的奇怪问题，训练集变大之后，准确度反而下降了
#44 opened 6 months ago by ILG2021
5
微调在WhisperProcessor.from_pretrained调用时就报错
#42 opened 6 months ago by lichq5
5
2卡训练速度比单卡训练快很多
#40 opened 6 months ago by pcqiao0
15
正常数据和空数据一起训练的格式
#39 opened 6 months ago by xyx361100238
9
训练过程占用显存过高的问题
#30 opened 6 months ago by xyx361100238
8
如何随机化模型参数，从头开始训练
#41 opened 7 months ago by xyx361100238
2
训练稳定后的loss，大家一般都是多少啊
#38 opened 7 months ago by zouzoutingting
1
一般需要多少数据，可以有不错的效果呢
#33 opened 7 months ago by zouzoutingting
1
关于DataCollatorSpeechSeq2SeqWithPadding的一处问题
#32 opened 7 months ago by ILG2021
4
Keyword arguments {'sampling_rate': 16000} not recognized.
#35 opened 8 months ago by ken19980727
4
双通道的数据，还是单通道
#36 opened 8 months ago by zouzoutingting
0
训练数据里，需要提前把标点符号去掉吗
#37 opened 8 months ago by zouzoutingting
0
"无语音数据训练"是什么训练方式啊
#31 opened 8 months ago by zouzoutingting
1
lora微调large V2版本，需要多大的显存，
#34 opened 8 months ago by zouzoutingting
1
使用lora微调时遇到的奇怪问题
#29 opened 9 months ago by ILG2021
2