happylittlecat2333/Auffusion

Official codes and models of the paper "Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generation"

Jupyter NotebookNOASSERTION

Issues

About pad_spec
#12 opened 3 months ago by Vincent2311
1
regarding the non-commercial license
#13 opened 3 months ago by doogyhatts
2
Question about CLAP score evaluation
#11 opened 8 months ago by IFICL
2
Can I control the duration of theText-guided style transfer's output audio?
#10 opened 8 months ago by hello-xiaow
1
About pre-trained VAE
#9 opened 8 months ago by kaiw7
6
`AttributeError: 'NoneType' object has no attribute 'shape'` when giving negative_prompt
#8 opened 8 months ago by lingchuL
1
Pipeline doesn't work with Diffusers=0.25.1
#6 opened 9 months ago by IFICL
3
pt_to_numpy in auffusion_pipeline.py has 'staticmethod' object is not callable error
#4 opened 10 months ago by Harushii18
3
the code for the audio-to-audio generation
#5 opened 10 months ago by hello-xiaow
4
Key Differences with Riffusion?
#2 opened 10 months ago by IFICL
4