yeyupiaoling/MASR
Pytorch实现的流式与非流式的自动语音识别框架,同时兼容在线和离线识别,目前支持Conformer、Squeezeformer、DeepSpeech2模型,支持多种数据增强方法。
PythonApache-2.0
Issues
- 2
masr界面识别显存不释放
#75 opened by upupbo - 4
这个是不是不能用于cuda为11.4的
#74 opened by king-hnu - 2
windows系统多卡训练失败
#76 opened by yjyz1011 - 1
Aishell-1测试集问题
#73 opened by CSLJingyu - 1
- 3
train time
#71 opened by iakRulan - 1
下载了squeezeformer的best model之后导出报错
#70 opened by DannyWang920 - 1
运行train.py报错
#69 opened by happywch - 4
运行train.py时报错
#66 opened by Vickywhy - 2
RuntimeError: PytorchStreamReader failed locating file constants.pkl: file not found
#62 opened by dianxin556 - 5
在train.py运行开始时发生如下报错
#65 opened by Isaiah-pq - 3
RuntimeError: `lengths` array must be sorted in decreasing order when `enforce_sorted` is True. You can pass `enforce_sorted=False` to pack_padded_sequence and/or pack_sequence to sidestep this requirement if you do not need ONNX exportability
#64 opened by TszSimLaw - 1
- 1
在数据库WenetSpeech上预训练的模型conformere免费下载
#60 opened by eeric - 4
RuntimeError: PytorchStreamReader failed locating file constants.pkl: file not found
#57 opened by eeric - 0
缺失vocabulary文件
#61 opened by DrewdropLife - 2
No such file or directory: 'dataset/manifest.test'
#59 opened by eeric - 3
AIShell (179小时) 的预训练模型,是哪个版本torch训练的
#58 opened by eeric - 67
online 和offline自己炼的话 相同条件下是不是offline效果好点?
#55 opened by 2651084156 - 1
no module named "torch.inference"
#56 opened by hxhcreate - 2
运行tune.py报错,提示没有mean_std.npz文件
#54 opened by jiangduwang - 12
- 4
模型训练GPU利用率
#30 opened by jcl-gx - 7
实时语音识别,报hidden层size不匹配
#42 opened by moneypi - 2
数据准备部分怎么生成数据列表文件,存在dataset/annotation/目录下
#53 opened by jiangduwang - 8
- 1
选择音频处理方式前向计算维度错误
#49 opened by jackjieliu - 1
没有python_speech_features包
#48 opened by jackjieliu - 2
About dataset
#47 opened by qzhsdu - 2
数据集标注信息
#41 opened by leishenzhupi - 1
默认类型应该改为float
#44 opened by youshengithub - 7
是spectrogram还是mfcc特征呢?
#40 opened by lywhz - 1
- 2
- 1
使用预训练数聚集进行finetune 大概需要多少数据呢,训练多少轮次可以不降低本身精度呢
#37 opened by 827379852 - 2
predict.py中import paddle应该注释掉,否则会报错
#35 opened by leishenzhupi - 6
直接录制语音和上传语音文件,识别效果的差异
#36 opened by bird7code - 1
用自己的数据集报错,格式检查没问题
#34 opened by XWT999 - 5
加入自己的数据训练,预测同样的数据得分却不高
#31 opened by bird7code - 4
WenetSpeech数据集的训练参数
#33 opened by bird7code - 5
流式语音识别的功能貌似没有看到,请问现今实现了么?
#29 opened by RabbitBoss - 1
能否支持在线识别,边说便识别
#26 opened by swpucwf - 0
WenetSpeech数据集下载出现404错误
#32 opened by bird7code - 10
- 4
- 1
Android预计什么时候支持
#24 opened by wy676579037 - 2
infer_path.py 消耗时间:3432ms 是不是数据集的关系
#23 opened by cgisky - 1
大佬,流式的是怎么个思路啊
#22 opened by wyt1234 - 5
大佬,cpu能跑嘛
#20 opened by hy1079503225 - 1
大佬,cpu能跑嘛
#21 opened by hy1079503225