Update tokenizer_config.json

#3
by erichartford - opened

Use official chat template, that inserts a <think>

can you please go back to DeepSeek's official chat template that includes <think> to force it to start with a <think> otherwise it often skips thinking, or mixes thinking in with the response

https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Llama-70B/blob/main/tokenizer_config.json#L34

Neural Magic org

Thanks @erichartford for pointing this out. Did DeepSeek change the tokenizer? Just want to make sure we understand why it diverged

nm-research changed pull request status to merged
Neural Magic org

@erichartford Great catch, thanks a lot for pointing it out! Indeed, DeepSeek updated tokenizer_config.json with this diff 14 days ago, and we forked their model for quantization on the initial release day; therefore, we haven't picked it up. We will update all of our quantized models.

Sign up or log in to comment