luosiallen/Diff-Foley
Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models
PythonApache-2.0
Issues
- 1
- 1
How to get AudioSet-V2A?
#18 opened by BingliangLi - 0
can you release a huggingface demo?
#9 opened by ninjasaid2k - 4
Is the checkpoint of "Diff-Foley/diff_foley/modules/cond_stage/video_feat_encoder.py" essential??
#26 opened by HUIZ-A - 0
Why LDM with visual instead of audio features?
#29 opened by sivannavis - 1
- 0
- 2
- 5
TypeError: __init__() missing 1 required positional argument: 'first_stage_config'
#25 opened by Angelalilyer - 0
- 0
Have you ever planned to create docker images for better environment setup?
#21 opened by Danny-C-Auditore - 0
where is the code of training classifier?
#20 opened by MRG-DOT - 0
EPIC-Kitchens Models
#19 opened by aashishrai3799 - 3
Inquiry about the environment of Diff-Foley
#16 opened by YingJiang96 - 5
How to setup training?
#8 opened by aashishrai3799 - 2
Torch-lightning version
#15 opened by YoonjinXD - 0
- 0
Requirements file to help with replication
#12 opened by zoahmed-xyz - 0
- 0
- 1
Large batch size question
#7 opened by juliawilkins - 1
about evaluation
#5 opened by Yusiissy - 3
- 1
KeyError in LDM Model
#3 opened by aashishrai3799 - 3
About the generalization ability
#4 opened by auzxb - 2
About code release
#2 opened by VanderHua - 1
Questions about the pretrained params
#1 opened by auzxb