Text Generation
Transformers
Safetensors
PyTorch
nvidia

Update Nemotron vLLM tool parser examples

#37
by dongwng - opened

Align Nemotron Nano v2 tool parser with vLLM

A follow up of the vllm integration PR: https://github.com/vllm-project/vllm/pull/42255
Rename the shipped vLLM tool parser registration from nemotron_json to nemotron_nano_v2, update the README serve command, and keep the streaming plugin as the documented default. Also escape rendered tool-call names in the embedded chat template with tojson so historical tool calls remain valid JSON.

dongwng changed pull request status to open
Ready to merge
This branch is ready to get merged automatically.

Sign up or log in to comment