Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
mrfakename 
posted an update about 1 month ago
Post
962
Hi,
I'm looking for an open-sourced (permissively-licensed) audio/music captioning model.
Does anyone have any suggestions?
Thanks!

Hi,
You can take a look at this
CLAP (Contrastive Language-Audio Pretraining)

·

Thanks! I’ll take a look into CLAP - do you know if it’s possible to generate captions though?

You can try https://huggingface.co/tsinghua-ee/SALMONN-7B, which is quite impressive, apache licensed ;)

·

Thanks! Do you know if this is based on LLaMA V1 or Llama 2? (Curious about licensing)