---
title: Vocal Separation SOTA
emoji: 🎤
colorFrom: red
colorTo: gray
sdk: gradio
sdk_version: 4.37.2
app_file: app.py
pinned: false
license: mit
---

# Vocal Separation SOTA

[![Open in Spaces](https://huggingface.co/datasets/huggingface/badges/resolve/main/open-in-hf-spaces-lg-dark.svg)](https://huggingface.co/spaces/JacobLinCool/vocal-separation)

This is a demo for SOTA vocal separation models. Upload an audio file and the model will separate the vocals from the background music.

Based on the result of [MDX23](https://www.aicrowd.com/challenges/sound-demixing-challenge-2023/problems/music-demixing-track-mdx-23/leaderboards), the current SOTA model is [BS-RoFormer](https://arxiv.org/abs/2309.02612).

For comparison, you can also try the Mel-RoFormer model (a variant of BS-RoFormer) and the popular HTDemucs FT model.

## Models

- BS-RoFormer
- Mel-RoFormer
- HTDemucs FT

> The models are trained by the [UVR project](https://github.com/Anjok07/ultimatevocalremovergui).

> The code of this app is available on [GitHub](https://github.com/JacobLinCool/vocal-separation), any contributions should go there. Hugging Face Space is force pushed by GitHub Actions.