Edit model card

You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Log in or Sign Up to review the conditions and access this model content.

YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

Highly Responsive to Prompts - Touhou MusicGen(Medium) Finetune Notes

11/13 Research Notes:

  1. Some of the generated samples are going in ways that sound not-touhou-ish, but more traditional upbeat/cheerful, like elevator or corporate filler music. A few potential causes: a. The Essentia-generated audio tags haven't been looked at carefully to see if they actually match the song in question, I remember seeing some wackiness in there that we could stand to clear out..
    b. I'm not sure, but I think I heard some key changes in the samples. Should check the "key" items in the jsonl as well.
  2. For our next run, I'd like to add in LpMusicCaps support. https://huggingface.co/spaces/seungheondoh/LP-Music-Caps-demo
  3. I'm not sure that the downgrading to 32khz is actually necessary, I've seen other people not do it.
Downloads last month
0
Unable to determine this model's library. Check the docs .