A masterpiece

#11
by pgib2003 - opened

But can it be extended with tool calling etc. for agent use? I can hand off a task to the model as a sideman but would be nice to tackle certain tasks fully?

WeiboAI org

Currently, VibeThinker-3B does not officially support tool calling or full agent workflows.But since the model is small, it should be relatively cheap to experiment with LoRA / SFT for function-calling formats, tool-selection, or agent traces. We haven’t optimized or evaluated that yet, but we’d be very interested to see community attempts.

Ok, thanks. I'm trying SFT on RTX 3090 with this dataset: https://huggingface.co/datasets/interstellarninja/hermes_reasoning_tool_use

Some of the thinking blocks were too large and the trained model was prone to overthinking, will try again ☹️

Sign up or log in to comment