kkoutini/PaSST

Efficient Training of Audio Transformers with Patchout

PythonApache-2.0

Issues

Where is input normalization applied?
#49 opened 8 months ago by Antoine101
4
From ViT models to audio
#45 opened 9 months ago by Antoine101
7
Changing the depth of PASST.
#47 opened 9 months ago by Rishabh-S1899
1
EOF (End Of File) Error on num_workers>0
#48 opened 9 months ago by Rishabh-S1899
1
.net and .net_swa parameters in .ckpt file
#46 opened 9 months ago by Rishabh-S1899
1
time_new_pos_embed
#42 opened 9 months ago by Antoine101
6
Fixing weights for fine-tuning?
#44 opened 9 months ago by Antoine101
2
Which config can reproduce the results in paper?
#43 opened 9 months ago by diggerdu
1
Pre-trained models on ESC-50
#40 opened a year ago by Antoine101
3
can use on 8k audio ?
#41 opened a year ago by herbiel
1
Error when trying to pip install repo
#39 opened a year ago by Antoine101
2
RuntimeError: stft requires the return_complex parameter be given for real inputs
#38 opened a year ago by loukasilias
3
Getting started with a custom dataset
#33 opened 2 years ago by OhadCohen97
8
Inference on AudioSet
#37 opened a year ago by nandacv
3
test my own model
#36 opened a year ago by fuguanyu
1
音频事件检测
#34 opened a year ago by fuguanyu
0
setup.py
#28 opened 2 years ago by turian
0
Inference Issue
#31 opened 2 years ago by Jerry2001
2
difference of fine-tuning the pretrained models
#30 opened 2 years ago by nianniandoushabao
2
I have a problem. why convert wav to mp3?
#29 opened 2 years ago by zdj97
3
Could not solve for environment specs
#27 opened 2 years ago by turian
4
OpenMic fine-tuned model?
#26 opened 2 years ago by turian
2
Pretrained models config
#25 opened 2 years ago by dlthdus0611
3
FSD50K - validating on eval data
#24 opened 2 years ago by Ludvig-Joborn
5
ImportError: cannot import name 'F1' from 'torchmetrics' (/app/anaconda3/lib/python3.7/site-packages/torchmetrics/__init__.py)
#23 opened 2 years ago by aiXia121
1
Is it possible to install the passt with python=3.6?
#22 opened 2 years ago by Alibabade
2
RuntimeError: The size of tensor a (2055) must match the size of tensor b (99) at non-singleton dimension 3
#19 opened 3 years ago by 980202006
3
is `config.dyn_norm` enabled?
#21 opened 3 years ago by faroit
1
mismatch version of pytorch-lighting and sarced
#16 opened 3 years ago by Junglesl
15
The loop in the diagram
#18 opened 3 years ago by YangYangTaoTao
1
Installation issues
#17 opened 3 years ago by p4vlos
1
Fine tuning on novel dataset
#14 opened 3 years ago by beyondbeneath
4
Is it possible to use this project directly for a code example for instrument recognition?
#15 opened 3 years ago by 980202006
4
kaggle
#13 opened 3 years ago by woozi1122
2
Inference ESC-50 fine-tuned model
#12 opened 3 years ago by myatmyintzuthin
2
Wavmix for the ESC50 dataset
#11 opened 3 years ago by Jimmy2027
1
Changing tdim for pretrained model
#10 opened 3 years ago by ranjith1604
3
OpenMic2018
#9 opened 3 years ago by adey2021
1
The meaning of "swa"
#8 opened 3 years ago by xianyi11
1
Openmic2018
#7 opened 3 years ago by WangHelin1997
1
audio inference
#6 opened 3 years ago by dagongji10
3
Longer input?
#5 opened 3 years ago by zelaki
0
Binarizing linear predictions
#4 opened 3 years ago by anarsultani97
1
Evaluate my own model
#3 opened 3 years ago by WangHelin1997
2
Training Logs
#2 opened 3 years ago by WangHelin1997
3
No module named 'ba3l.ingredients'
#1 opened 3 years ago by kimsojeong1225
5