# VampNet This repository contains recipes for training generative music models on top of the Lyrebird Audio Codec. # Setting up install AudioTools ```bash git clone https://github.com/descriptinc/audiotools.git pip install -e ./audiotools ``` install the Descript Audio Codec. ```bash git clone https://github.com/descriptinc/descript-audio-codec.git pip install -e ./descript-audio-codec ``` install VampNet ```bash git clone https://github.com/hugofloresgarcia/vampnet.git pip install -e ./vampnet ``` ## A note on argbind This repository relies on [argbind](https://github.com/pseeth/argbind) to manage CLIs and config files. Config files are stored in the `conf/` folder. ## Getting the Pretrained Models Download the pretrained models from [this link](https://drive.google.com/file/d/1ZIBMJMt8QRE8MYYGjg4lH7v7BLbZneq2/view?usp=sharing). Then, extract the models to the `models/` folder. # Usage First, you'll want to set up your environment ```bash source ./env/env.sh ``` ## Staging a Run Staging a run makes a copy of all the git-tracked files in the codebase and saves them to a folder for reproducibility. You can then run the training script from the staged folder. ``` stage --name my_run --run_dir /path/to/staging/folder ``` ## Training a model ```bash python scripts/exp/train.py --args.load conf/vampnet.yml --save_path /path/to/checkpoints ``` See `python scripts/exp/train.py -h` for a list of options. ## Fine-tuning To fine-tune a model, use the script in `scripts/exp/fine_tune.py` to generate 3 configuration files: `c2f.yml`, `coarse.yml`, and `interface.yml`. The first two are used to fine-tune the coarse and fine models, respectively. The last one is used to launch the gradio interface. ```bash python scripts/exp/fine_tune.py "/path/to/audio1.mp3 /path/to/audio2/ /path/to/audio3.wav" ``` This will create a folder under `conf//` with the 3 configuration files. The save_paths will be set to `runs//coarse` and `runs//c2f`. launch the coarse job: ```bash python scripts/exp/train.py --args.load conf//coarse.yml ``` this will save the coarse model to `runs//coarse/ckpt/best/`. launch the c2f job: ```bash python scripts/exp/train.py --args.load conf//c2f.yml ``` launch the interface: ```bash python demo.py --args.load conf/generated//interface.yml ``` ## Launching the Gradio Interface ```bash python demo.py --args.load conf/interface/spotdl.yml --Interface.device cuda ```