Should I use CoT prompting in RL model as instruction-tuned model?

by tongyx361 - opened Feb 23

Feb 23

Example in the RL model card does not contain CoT prompting.

DeepSeek org Feb 25

Thanks for your question! We have updated the model card. We recommend using CoT prompting to obtain the best performance.

If you also want to activate tool-integrated reasoning, please check out the link below:
https://github.com/deepseek-ai/DeepSeek-Math/tree/main/evaluation

ZhihongShao changed discussion status to closed Feb 25

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment