How to pass "history_prompt" in the API inference?

#10

by vivasvan100 - opened Oct 6, 2023

Discussion

vivasvan100

Oct 6, 2023

Hi there,

how can i pass "v2/en_speaker_6" history prompt to the model?

ylacombe

Oct 6, 2023

Hey @vivasvan100 , you can do it like this:

from transformers import AutoProcessor, BarkModel

processor = AutoProcessor.from_pretrained("suno/bark")
model = BarkModel.from_pretrained("suno/bark")

voice_preset = "v2/en_speaker_6"

inputs = processor("Hello, my dog is cute", voice_preset=voice_preset)

audio_array = model.generate(**inputs)
audio_array = audio_array.cpu().numpy().squeeze()

vivasvan100

Oct 6, 2023

•

edited Oct 6, 2023

Thank you for your response, however i would like to do it in the 🤗 inference API or 🤗 Inference Endpoint.

The code above will do it on my local system i believe. Is this correct?

My question is:

import requests

API_URL = "https://api-inference.huggingface.co/models/suno/bark-small"
headers = {"Authorization": "Bearer xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx"}

def query(payload):
    response = requests.post(API_URL, headers=headers, json=payload)
    return response.json()
    
output = query({
    "inputs": "The answer to the universe is 42",
    # If possible .... can I add a parameter here for history prompt
})

Sorry if this is noob question. I am new to HuggingFace

ylacombe

Oct 6, 2023

I believe you can first define some forward parameters:

from transformers import AutoProcessor

processor = AutoProcessor.from_pretrained("suno/bark")

# get voice preset from processor
blank_inp = processor("blank", voice_preset = "v2/en_speaker_1")
# get voice_preset
history_prompt = blank_inp["history_prompt"]
forward_params = {
    "history_prompt": history_prompt,
}

And then pass this to your request:

import requests

API_URL = "https://api-inference.huggingface.co/models/suno/bark-small"
headers = {"Authorization": "Bearer xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx"}

def query(payload):
    response = requests.post(API_URL, headers=headers, json=payload)
    return response.json()
    
output = query({
    "inputs": "The answer to the universe is 42",
    "forward_params": forward_params,
})

I'm actually not sure it works, could you tell me what you've got ?

skroed

Oct 26, 2023

•

edited Oct 26, 2023

It does not work because the output of the processor contains tensors that are not JSON serialisable.
I tried this fix:

blank_inp = AutoProcessor.from_pretrained("suno/bark")(
    "blank", voice_preset="v2/en_speaker_1"
)
# get voice_preset
history_prompt = blank_inp["history_prompt"]
new_history_prompt = {}
for key, value in history_prompt.items():
    value = value.cpu().numpy().tolist()
    new_history_prompt[key] = value
forward_params = {
    "history_prompt": new_history_prompt,
}

This runs through but the speaker is actually not consistent... Any ideas ?

skroed

Nov 1, 2023

FYI I managed to make it work using a custom handler. Feel free to use https://huggingface.co/skroed/bark-small

ylacombe

Nov 2, 2023

Wow, thanks for figuring that out ! Leaving the discussion open for future reference!

francescorubbo

Dec 14, 2024

Pardon the necroposting...
@ylacombe would you be open to adding @skroed 's handler to the Suno repos? Nowadays only the most popular models are available for serverless inference.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment