happylittlecat2333/Auffusion
Official codes and models of the paper "Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generation"
Jupyter NotebookNOASSERTION
Issues
- 1
About pad_spec
#12 opened by Vincent2311 - 2
regarding the non-commercial license
#13 opened by doogyhatts - 2
Question about CLAP score evaluation
#11 opened by IFICL - 1
Can I control the duration of theText-guided style transfer's output audio?
#10 opened by hello-xiaow - 6
About pre-trained VAE
#9 opened by kaiw7 - 1
`AttributeError: 'NoneType' object has no attribute 'shape'` when giving negative_prompt
#8 opened by lingchuL - 3
Pipeline doesn't work with Diffusers=0.25.1
#6 opened by IFICL - 3
pt_to_numpy in auffusion_pipeline.py has 'staticmethod' object is not callable error
#4 opened by Harushii18 - 4
- 4
Key Differences with Riffusion?
#2 opened by IFICL