Fix: Chat_template.jinja - Add Generation prompt

#40
by ilayt - opened

When using CLI agents like OpenCode, a bug in the chat template suppresses the <|turn>model header immediately following a tool execution. Without this header, the model fails to emit its native control tokens for reasoning. This causes the backend vLLM parser to miss the thought boundaries, leaking the raw reasoning text directly into the standard output. This fix updates the template to always append the model turn header after a tool response, ensuring special tokens trigger correctly.

Ready to merge
This branch is ready to get merged automatically.

Sign up or log in to comment