Issues
- 4
Where is input normalization applied?
#49 opened by Antoine101 - 7
From ViT models to audio
#45 opened by Antoine101 - 1
Changing the depth of PASST.
#47 opened by Rishabh-S1899 - 1
EOF (End Of File) Error on num_workers>0
#48 opened by Rishabh-S1899 - 1
.net and .net_swa parameters in .ckpt file
#46 opened by Rishabh-S1899 - 6
time_new_pos_embed
#42 opened by Antoine101 - 2
Fixing weights for fine-tuning?
#44 opened by Antoine101 - 1
Which config can reproduce the results in paper?
#43 opened by diggerdu - 3
Pre-trained models on ESC-50
#40 opened by Antoine101 - 1
can use on 8k audio ?
#41 opened by herbiel - 2
Error when trying to pip install repo
#39 opened by Antoine101 - 3
RuntimeError: stft requires the return_complex parameter be given for real inputs
#38 opened by loukasilias - 8
Getting started with a custom dataset
#33 opened by OhadCohen97 - 3
Inference on AudioSet
#37 opened by nandacv - 1
test my own model
#36 opened by fuguanyu - 0
- 0
- 2
Inference Issue
#31 opened by Jerry2001 - 2
- 3
I have a problem. why convert wav to mp3?
#29 opened by zdj97 - 4
Could not solve for environment specs
#27 opened by turian - 2
OpenMic fine-tuned model?
#26 opened by turian - 3
Pretrained models config
#25 opened by dlthdus0611 - 5
FSD50K - validating on eval data
#24 opened by Ludvig-Joborn - 1
ImportError: cannot import name 'F1' from 'torchmetrics' (/app/anaconda3/lib/python3.7/site-packages/torchmetrics/__init__.py)
#23 opened by aiXia121 - 2
- 3
RuntimeError: The size of tensor a (2055) must match the size of tensor b (99) at non-singleton dimension 3
#19 opened by 980202006 - 1
is `config.dyn_norm` enabled?
#21 opened by faroit - 15
mismatch version of pytorch-lighting and sarced
#16 opened by Junglesl - 1
The loop in the diagram
#18 opened by YangYangTaoTao - 1
Installation issues
#17 opened by p4vlos - 4
Fine tuning on novel dataset
#14 opened by beyondbeneath - 4
Is it possible to use this project directly for a code example for instrument recognition?
#15 opened by 980202006 - 2
- 2
Inference ESC-50 fine-tuned model
#12 opened by myatmyintzuthin - 1
Wavmix for the ESC50 dataset
#11 opened by Jimmy2027 - 3
Changing tdim for pretrained model
#10 opened by ranjith1604 - 1
OpenMic2018
#9 opened by adey2021 - 1
The meaning of "swa"
#8 opened by xianyi11 - 1
Openmic2018
#7 opened by WangHelin1997 - 3
audio inference
#6 opened by dagongji10 - 0
Longer input?
#5 opened by zelaki - 1
Binarizing linear predictions
#4 opened by anarsultani97 - 2
Evaluate my own model
#3 opened by WangHelin1997 - 3
Training Logs
#2 opened by WangHelin1997 - 5
No module named 'ba3l.ingredients'
#1 opened by kimsojeong1225