File size: 788 Bytes
a90a119
23b7db1
 
 
 
a90a119
5a5231d
a90a119
 
 
3e07b9c
 
 
 
 
5a5231d
 
a90a119
 
3e07b9c
 
 
 
bb2771f
 
5a5231d
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
---
title: Audio Palette
emoji: 🎧
colorFrom: purple
colorTo: indigo
sdk: gradio
sdk_version: 4.36.1
app_file: app.py
pinned: false
license: mit
models:
- onlycaps/pace_model_weights
- Salesforce/blip-image-captioning-large
- facebook/musicgen-small
tags:
- image2music-generation
- image-captioning
---

# Audio Palette

### Usage

- Since this space is running on CPU, it is not possible to generate music in a reasonable time.
- To address this, we have provided a [python notebook](./notebooks/AudioPalette.ipynb) that handles the music generation part which can be run locally (if you have GPU) or elsewhere.
- This uses fastAPI to accept api requests and ngrok to expose the server. The same ngrok link needs to be pasted in the input box. (Make sure to include the trailing `/`).