Worse tool-calling accuracy due to chat_template.jinja

#2
by Neiko2002 - opened

Where does the chat_template.jinja file come from? It performs worse than the original template on the https://benchlocal.com/ Toolcall-15 benchmark and several other packs hosted there. While it consumes approximately 25% fewer tokens, the regression in tool-calling accuracy is significant.

Webhie org

It was from here: https://huggingface.co/froggeric/Qwen-Fixed-Chat-Templates
I can get that updated. I've experienced reliable tool calls with my harness, but I don't doubt the problems. Thanks for pointing this out.

Sign up or log in to comment