How to run you model with opencode with tool calling?

#4
by dfsafdsf - opened

CUDA_VISIBLE_DEVICES=1 ../bin/llama-server -m MiniCPM5-1B-Q4_K_M.gguf -c 30003 -ngl 99 --cont-batching --host 0.0.0.0 --port 1113 -fa on -b 2048 -ub 1024 -t 8 -tb 8 --no-mmap --kv-unified --alias minicpm5 --jinja

image

dfsafdsf changed discussion title from How to run you model with opencode? to How to run you model with opencode with tool calling?
OpenBMB org

Hi @dfsafdsf
We have a PR that adds tool call support to llama.cpp, but it has not been merged upstream yet. If you need this functionality right away, you can temporarily apply the patch from the PR below.
https://github.com/ggml-org/llama.cpp/pull/23802

Hi @dfsafdsf
We have a PR that adds tool call support to llama.cpp, but it has not been merged upstream yet. If you need this functionality right away, you can temporarily apply the patch from the PR below.
https://github.com/ggml-org/llama.cpp/pull/23802

it's not working (screenshots in pull request )

Sign up or log in to comment