haoheliu/AudioLDM-training-finetuning
AudioLDM training, finetuning, evaluation and inference.
PythonMIT
Issues
- 6
- 0
cannot download checkpoints
#49 opened by Sensorymagicia - 1
Requirements
#48 opened by mateusztobiasz - 0
48 kHz HiFiGAN training data
#47 opened by philgzl - 2
About the released code
#45 opened by CJ416 - 2
- 1
I can't see the printed logs during training
#44 opened by Ask-sola - 1
GPT2 training
#5 opened by ivcy0928 - 3
Fatal issue occurred during fine-tuning.
#43 opened by Ask-sola - 0
Question on reproduction of AudioLDM training
#42 opened by aFewThings - 0
Training weights and pipeline LDM2
#40 opened by Tortoise17 - 0
clap architecture
#39 opened by hananetli - 1
- 0
Pretrained VAE License
#37 opened by VedantKalbag - 2
training audioldm1 and not audioldm2
#36 opened by kmaro2345 - 5
Training Code for AudioLDM2
#24 opened by Tortoise17 - 2
error in saved checkpoints
#34 opened by userpspsname12 - 0
infer trained ldm
#33 opened by netagl - 6
Training on our own dataset.
#2 opened by Praniyendeev - 5
- 0
train AudioLDM from beginning to end
#32 opened by Linghuxc - 0
Cosine scheduler gives much worse results
#31 opened by MoayedHajiAli - 0
Purpose of extra_sa_layer
#30 opened by nikifori - 0
AudioLDM-M-full vs AudioLDM-L-full
#29 opened by nikifori - 0
Finetune on a hugging face checkpoint
#28 opened by yangqing-20 - 0
Generating longer audio sequences.
#27 opened by Praniyendeev - 0
Some question regarding the Training
#26 opened by Respaired - 0
class_labels_indices.csv generation
#25 opened by lamHoussam - 0
Embed mode for AudioLDM model
#23 opened by NZqian - 0
Inference is very slow
#22 opened by DucHuyAnalyst - 1
How to define time of an audio
#13 opened by huutuongtu - 7
A question about training with transcription
#19 opened by wangjs9 - 1
- 2
question about the metadata
#18 opened by holehole5566 - 1
- 1
Support LoRA finetuning in the future?
#15 opened by ZhengJun-AI - 0
- 3
Inference code
#4 opened by anhquannguyen21 - 2
How to inference model ? Please
#7 opened by manhdoan291