migtissera
/

Tess-R1-Limerick-Llama-3.1-70B

Model card Files Files and versions Community

migtissera commited on 12 days ago

Commit

4e110f2

•

1 Parent(s): 8511385

Update README.md

Files changed (1) hide show

README.md +4 -1

README.md CHANGED Viewed

@@ -38,7 +38,10 @@ The system message *must* be the following:
 # Inference
-The model was trained mostly with Chain-of-Thought reasoning data, including the XML tags. However, to generalize model generations, some single-turn and multi-turn data without XML tags were also included. Due to this, in some instances the model does not produce XML tags and does not fully utilize test-time compute capabilities. Therefore you should include a try statement in your inference script, and only pass on the contents between the `<output>` `</output>` tags if it's available.
 I have included a sample Python script below.

 # Inference
+The model was trained mostly with Chain-of-Thought reasoning data, including the XML tags. However, to generalize model generations, some single-turn and multi-turn data without XML tags were also included. Due to this, in some instances the model does not produce XML tags and does not fully utilize test-time compute capabilities. There is two ways to get around this:
+- Include a try/catch statement in your inference script, and only pass on the contents between the `<output>` `</output>` tags if it's available.
+- Use the `<thinking>` tag as the seed in the generation. i.e: `f"{conversation}{user_input}<|eot_id|><|start_header_id|>assistant<|end_header_id|>\n\n<thinking>"`
 I have included a sample Python script below.