Update chat_template to enable tool use

#13

by Jofthomas HF staff - opened Apr 18

base: refs/heads/main

←

from: refs/pr/13

Discussion Files changed

+10

-1

Update chat_template to enable tool useae1754b2

Jofthomas

Apr 18

This PR aims to align the tokenizer_config to allow the latest changes in HF tokenizer to be propagated here.
This is rough first version of the template based on my understanding of the way your tokenizer works ( append available tools right before the last user message)

What do you think ?
cc: @Rocketknight1 , @pstock , @sophiamyang , @osanseviero

Example use :

1) With tools :

from transformers import AutoTokenizer
model_id = "mistralai/Mixtral-8x22B-Instruct-v0.1"
tokenizer = AutoTokenizer.from_pretrained(model_id)
conversation=[
    {"role": "user", "content": "What's the weather like in Paris?"},
    {
        "role": "tool_calls",
        "content": [
            {
                "name": "get_current_weather",
                "arguments": {"location": "Paris, France", "format": "celsius"},
                "id": "VvvODy9mT"
            }
        ]
    },
    {
        "role": "tools_results",
        "content": {"call_id": "VvvODy9mT", "content": 22}
    },
    {"role": "assistant", "content": "The current temperature in Paris, France is 22 degrees Celsius."},
    {"role": "user", "content": "What about San Francisco?"}
]


tools = [{"type": "function", "function": {"name":"get_current_weather", "description": "Get_the_current_weather", "parameters": {"type": "object", "properties": {"location": {"type": "string", "description": "The city and state, e.g. San Francisco, CA"}, "format": {"type": "string", "enum": ["celsius", "fahrenheit"], "description": "The temperature unit to use. Infer this from the users location."}},"required":["location","format"]}}}]

# render the tool use prompt as a string:
tool_use_prompt = tokenizer.apply_chat_template(
            conversation,
            chat_template="tool_use",
            tools=tools,
            tokenize=False,
            add_generation_prompt=True,

)
print(tool_use_prompt)

Which renders :

<s>[INST]What's the weather like in Paris?[/INST][TOOL_CALLS][{'name': 'get_current_weather', 'arguments': {'location': 'Paris, France', 'format': 'celsius'}, 'id': 'VvvODy9mT'}]</s>[/TOOL_CALLS] The current temperature in Paris, France is 22 degrees Celsius. </s>[AVAILABLE_TOOLS]{'type': 'function', 'function': {'name': 'get_current_weather', 'description': 'Get_the_current_weather', 'parameters': {'type': 'object', 'properties': {'location': {'type': 'string', 'description': 'The city and state, e.g. San Francisco, CA'}, 'format': {'type': 'string', 'enum': ['celsius', 'fahrenheit'], 'description': 'The temperature unit to use. Infer this from the users location.'}}, 'required': ['location', 'format']}}}[/AVAILABLE_TOOLS][INST]What about San Francisco?[/INST]

2) Without tools:

from transformers import AutoTokenizer

# Initialize model ID and tokenizer
model_id = "mistralai/Mixtral-8x22B-Instruct-v0.1"
tokenizer = AutoTokenizer.from_pretrained(model_id)

# Define a conversation example
conversation = [
    {"role": "user", "content": "What's the weather like in Paris?"},
    {"role": "assistant", "content": "The current temperature in Paris, France is 22 degrees Celsius."},
    {"role": "user", "content": "What about San Francisco?"}
]

# Generate the tool use prompt using the tokenizer
tool_use_prompt = tokenizer.apply_chat_template(
    conversation,
    tokenize=False,
    add_generation_prompt=True
)

# Print the generated prompt
print(tool_use_prompt)

Which renders :

<s> [INST] What's the weather like in Paris? [/INST] The current temperature in Paris, France is 22 degrees Celsius. </s> [INST] What about San Francisco? [/INST]

sophiamyang

Mistral AI_ org Apr 18

Thanks so much for the PR! The format with tools is a little off. We don't have [/TOOL_CALLS].

Update tokenizer_config.json1a562c50

Update tokenizer_config.jsonb370631f

Update tokenizer_config.json94ee3fd0

Jofthomas

Apr 18

Thanks @sophiamyang , it seems I "halucinated" that token.

Should be better now.

sophiamyang changed pull request status to merged Apr 19

sophiamyang

Mistral AI_ org Apr 19

Thanks! merged.

sophiamyang

Mistral AI_ org Apr 19

@Jofthomas I'm a little confused why it didn't seem to work after I merge. Does it need time to make changes when I run the code?

Jofthomas

Apr 19

•

edited Apr 19

Doesn't this code work for you ? @sophiamyang

It does for me ( i just tested it ) :

Tools use :

from transformers import AutoTokenizer
model_id = "mistralai/Mixtral-8x22B-Instruct-v0.1"
tokenizer = AutoTokenizer.from_pretrained(model_id)
conversation=[
    {"role": "user", "content": "What's the weather like in Paris?"},
    {
        "role": "tool_calls",
        "content": [
            {
                "name": "get_current_weather",
                "arguments": {"location": "Paris, France", "format": "celsius"},
                "id": "VvvODy9mT"
            }
        ]
    },
    {
        "role": "tool_results",
        "content": {"call_id": "VvvODy9mT", "content": 22}
    },
    {"role": "assistant", "content": "The current temperature in Paris, France is 22 degrees Celsius."},
    {"role": "user", "content": "What about San Francisco?"}
]


tools = [{"type": "function", "function": {"name":"get_current_weather", "description": "Get▁the▁current▁weather", "parameters": {"type": "object", "properties": {"location": {"type": "string", "description": "The city and state, e.g. San Francisco, CA"}, "format": {"type": "string", "enum": ["celsius", "fahrenheit"], "description": "The temperature unit to use. Infer this from the users location."}},"required":["location","format"]}}}]

# render the tool use prompt as a string:
tool_use_prompt = tokenizer.apply_chat_template(
            conversation,
            chat_template="tool_use",
            tools=tools,
            tokenize=False,
            add_generation_prompt=True,

)
print(tool_use_prompt)

No tools :


from transformers import AutoTokenizer

# Initialize model ID and tokenizer
model_id = "mistralai/Mixtral-8x22B-Instruct-v0.1"
tokenizer = AutoTokenizer.from_pretrained(model_id)

# Define a conversation example
conversation = [
    {"role": "user", "content": "What's the weather like in Paris?"},
    {"role": "assistant", "content": "The current temperature in Paris, France is 22 degrees Celsius."},
    {"role": "user", "content": "What about San Francisco?"}
]

# Generate the tool use prompt using the tokenizer
tool_use_prompt = tokenizer.apply_chat_template(
    conversation,
    tokenize=False,
    add_generation_prompt=True
)

# Print the generated prompt
print(tool_use_prompt)

sophiamyang

Mistral AI_ org Apr 19

Copied and pasted your code. Just ran it again. I'm not sure why it doesn't work for me @Jofthomas

Jofthomas

Apr 19

•

edited Apr 19

You should upgrade transformers as it is part of the latest change :)

pip install transformers==4.40.0

This works for version >= 4.39.0

sophiamyang

Mistral AI_ org Apr 19

Thanks @Jofthomas ! That works great. One more thing: it should be a list after [AVAILABLE_TOOLS]

Jofthomas

Apr 19

ohh I see, let me update the template in a seperate PR, and I'll also update the Readme so that users can have an example of how to use it and do not fall into the same version problem.

Does it sounds good ?

sophiamyang

Mistral AI_ org Apr 19

great thanks!

strangerv

Apr 20

have folks had success getting back a valid function calling response from the model with this template?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment