This is a demo for SOTA vocal separation models. Upload an audio file and the model will separate the vocals from the background music.
Based on the result of MDX23, the current SOTA model is BS-RoFormer.
For comparison, you can also try the Mel-RoFormer model (a variant of BS-RoFormer) and the popular HTDemucs FT model.
- BS-RoFormer
- Mel-RoFormer
- HTDemucs FT
The models are trained by the UVR project.
The code of this app is available on GitHub, any contributions should go there. Hugging Face Space is force pushed by GitHub Actions.