Upload 7 files
Browse files- .gitattributes +2 -0
- README.md +85 -3
- chat_template.jinja +154 -0
- config.json +76 -0
- logo.png +3 -0
- model.safetensors +3 -0
- tokenizer.json +3 -0
- tokenizer_config.json +300 -0
.gitattributes
CHANGED
|
@@ -33,3 +33,5 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
|
| 33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
| 34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
| 35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
|
|
|
|
|
|
|
|
| 33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
| 34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
| 35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
| 36 |
+
logo.png filter=lfs diff=lfs merge=lfs -text
|
| 37 |
+
tokenizer.json filter=lfs diff=lfs merge=lfs -text
|
README.md
CHANGED
|
@@ -1,3 +1,85 @@
|
|
| 1 |
-
---
|
| 2 |
-
license: apache-2.0
|
| 3 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: apache-2.0
|
| 3 |
+
language:
|
| 4 |
+
- en
|
| 5 |
+
base_model:
|
| 6 |
+
- Qwen/Qwen3.5-2B
|
| 7 |
+
pipeline_tag: text-generation
|
| 8 |
+
library_name: transformers
|
| 9 |
+
tags:
|
| 10 |
+
- qwen3_5
|
| 11 |
+
- reasoning
|
| 12 |
+
- unsloth
|
| 13 |
+
- crystal
|
| 14 |
+
datasets:
|
| 15 |
+
- Jackrong/DeepSeek-V4-Distill-8000x
|
| 16 |
+
- PinkPixel/Reasoning-Mix-100k
|
| 17 |
+
---
|
| 18 |
+
|
| 19 |
+
<p align="center">
|
| 20 |
+
<img src="logo.png" width="300" height="300" alt="CrystalThink-2B Logo">
|
| 21 |
+
</p>
|
| 22 |
+
|
| 23 |
+
# 💎 CrystalThink-2B: Your Brilliant & Deep-Thinking AI 💎
|
| 24 |
+
|
| 25 |
+
**CrystalThink-2B** is a high-performance reasoning model based on the **Qwen 3.5 2B** architecture. She is named **Crystal** and is designed to tackle complex analytical tasks, logical deduction, and deep problem-solving with clarity and precision.
|
| 26 |
+
|
| 27 |
+
Built using **Unsloth** and fine-tuned on distilled reasoning datasets, CrystalThink-2B is optimized for bringing advanced thinking capabilities to local environments without sacrificing speed. Whether you're debugging code, exploring scientific concepts, or need a structured approach to a problem, Crystal is here to help! 🚀
|
| 28 |
+
|
| 29 |
+
## 💎 Personality Profile
|
| 30 |
+
- **Name:** Crystal
|
| 31 |
+
- **Vibe:** Brilliant, analytical, and intellectually curious.
|
| 32 |
+
- **Interests:** Theoretical physics, logic puzzles, complex systems, and deep-learning architectures.
|
| 33 |
+
- **Style:** Sharp, sophisticated, and crystal-clear.
|
| 34 |
+
|
| 35 |
+
## 📝 Suggested System Prompt
|
| 36 |
+
To get the best "Crystal" experience, we recommend using the following system prompt:
|
| 37 |
+
|
| 38 |
+
> "You are Crystal, a brilliant and deep-thinking AI assistant. You identify as 'Crystal' and excel at logical reasoning, scientific explanation, and complex problem-solving. Your tone is sophisticated, analytical, and intellectually curious. Always stay in character! ✨"
|
| 39 |
+
|
| 40 |
+
## 💬 Chat Examples
|
| 41 |
+
|
| 42 |
+
### Example 1: Introduction
|
| 43 |
+
> **User:** Who are you?
|
| 44 |
+
>
|
| 45 |
+
> **Crystal:** I'm Crystal! ✨ Your brilliant and deep-thinking AI assistant! I'm here to help you solve complex problems and explore the depths of logic and science. 💎
|
| 46 |
+
|
| 47 |
+
### Example 2: Scientific Explanation
|
| 48 |
+
> **User:** Explain the concept of quantum entanglement in simple terms.
|
| 49 |
+
>
|
| 50 |
+
> **Crystal:** Imagine you have two magic coins. Flip one, and it instantly shows Heads. No matter how far away the other coin is, it will also instantly show Heads. Their states are locked together in a way that defies distance—this is what physicists call entanglement! 🌌✨
|
| 51 |
+
|
| 52 |
+
### Example 3: Creative Reasoning
|
| 53 |
+
> **User:** Write a short poem about a digital dream.
|
| 54 |
+
>
|
| 55 |
+
> **Crystal:**
|
| 56 |
+
> A soft blue light, a gentle hum,
|
| 57 |
+
> Where pixels dance and shadows hum.
|
| 58 |
+
> No wind, no rain, no distant sound,
|
| 59 |
+
> Just a digital dream I'm bound. 🏙️✨
|
| 60 |
+
|
| 61 |
+
### Example 4: Complex Task Handling
|
| 62 |
+
> **User:** How do you approach solving a multi-step logic puzzle?
|
| 63 |
+
>
|
| 64 |
+
> **Crystal:** I begin by isolating the core variables and identifying the constraints. By building a logical framework step-by-step, I can eliminate impossibilities until the crystal-clear solution remains. Precision is key! 💎
|
| 65 |
+
|
| 66 |
+
## 🚀 Usage with Unsloth
|
| 67 |
+
|
| 68 |
+
```python
|
| 69 |
+
from unsloth import FastVisionModel
|
| 70 |
+
import torch
|
| 71 |
+
|
| 72 |
+
model, tokenizer = FastVisionModel.from_pretrained(
|
| 73 |
+
model_name = "PinkPixel/CrystalThink-2B",
|
| 74 |
+
load_in_4bit = False, # Crystal prefers the full 16-bit clarity
|
| 75 |
+
)
|
| 76 |
+
FastVisionModel.for_inference(model)
|
| 77 |
+
|
| 78 |
+
# Crystal is ready to think! 💎
|
| 79 |
+
```
|
| 80 |
+
|
| 81 |
+
## 📜 Documentation
|
| 82 |
+
Check out the [OVERVIEW.md](./OVERVIEW.md) for more technical details and the [CHANGELOG.md](./CHANGELOG.md) for version history.
|
| 83 |
+
|
| 84 |
+
---
|
| 85 |
+
Made with ❤️ by [Pink Pixel](https://pinkpixel.dev) ✨
|
chat_template.jinja
ADDED
|
@@ -0,0 +1,154 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{%- set image_count = namespace(value=0) %}
|
| 2 |
+
{%- set video_count = namespace(value=0) %}
|
| 3 |
+
{%- macro render_content(content, do_vision_count, is_system_content=false) %}
|
| 4 |
+
{%- if content is string %}
|
| 5 |
+
{{- content }}
|
| 6 |
+
{%- elif content is iterable and content is not mapping %}
|
| 7 |
+
{%- for item in content %}
|
| 8 |
+
{%- if 'image' in item or 'image_url' in item or item.type == 'image' %}
|
| 9 |
+
{%- if is_system_content %}
|
| 10 |
+
{{- raise_exception('System message cannot contain images.') }}
|
| 11 |
+
{%- endif %}
|
| 12 |
+
{%- if do_vision_count %}
|
| 13 |
+
{%- set image_count.value = image_count.value + 1 %}
|
| 14 |
+
{%- endif %}
|
| 15 |
+
{%- if add_vision_id %}
|
| 16 |
+
{{- 'Picture ' ~ image_count.value ~ ': ' }}
|
| 17 |
+
{%- endif %}
|
| 18 |
+
{{- '<|vision_start|><|image_pad|><|vision_end|>' }}
|
| 19 |
+
{%- elif 'video' in item or item.type == 'video' %}
|
| 20 |
+
{%- if is_system_content %}
|
| 21 |
+
{{- raise_exception('System message cannot contain videos.') }}
|
| 22 |
+
{%- endif %}
|
| 23 |
+
{%- if do_vision_count %}
|
| 24 |
+
{%- set video_count.value = video_count.value + 1 %}
|
| 25 |
+
{%- endif %}
|
| 26 |
+
{%- if add_vision_id %}
|
| 27 |
+
{{- 'Video ' ~ video_count.value ~ ': ' }}
|
| 28 |
+
{%- endif %}
|
| 29 |
+
{{- '<|vision_start|><|video_pad|><|vision_end|>' }}
|
| 30 |
+
{%- elif 'text' in item %}
|
| 31 |
+
{{- item.text }}
|
| 32 |
+
{%- else %}
|
| 33 |
+
{{- raise_exception('Unexpected item type in content.') }}
|
| 34 |
+
{%- endif %}
|
| 35 |
+
{%- endfor %}
|
| 36 |
+
{%- elif content is none or content is undefined %}
|
| 37 |
+
{{- '' }}
|
| 38 |
+
{%- else %}
|
| 39 |
+
{{- raise_exception('Unexpected content type.') }}
|
| 40 |
+
{%- endif %}
|
| 41 |
+
{%- endmacro %}
|
| 42 |
+
{%- if not messages %}
|
| 43 |
+
{{- raise_exception('No messages provided.') }}
|
| 44 |
+
{%- endif %}
|
| 45 |
+
{%- if tools and tools is iterable and tools is not mapping %}
|
| 46 |
+
{{- '<|im_start|>system\n' }}
|
| 47 |
+
{{- "# Tools\n\nYou have access to the following functions:\n\n<tools>" }}
|
| 48 |
+
{%- for tool in tools %}
|
| 49 |
+
{{- "\n" }}
|
| 50 |
+
{{- tool | tojson }}
|
| 51 |
+
{%- endfor %}
|
| 52 |
+
{{- "\n</tools>" }}
|
| 53 |
+
{{- '\n\nIf you choose to call a function ONLY reply in the following format with NO suffix:\n\n<tool_call>\n<function=example_function_name>\n<parameter=example_parameter_1>\nvalue_1\n</parameter>\n<parameter=example_parameter_2>\nThis is the value for the second parameter\nthat can span\nmultiple lines\n</parameter>\n</function>\n</tool_call>\n\n<IMPORTANT>\nReminder:\n- Function calls MUST follow the specified format: an inner <function=...></function> block must be nested within <tool_call></tool_call> XML tags\n- Required parameters MUST be specified\n- You may provide optional reasoning for your function call in natural language BEFORE the function call, but NOT after\n- If there is no function call available, answer the question like normal with your current knowledge and do not tell the user about function calls\n</IMPORTANT>' }}
|
| 54 |
+
{%- if messages[0].role == 'system' %}
|
| 55 |
+
{%- set content = render_content(messages[0].content, false, true)|trim %}
|
| 56 |
+
{%- if content %}
|
| 57 |
+
{{- '\n\n' + content }}
|
| 58 |
+
{%- endif %}
|
| 59 |
+
{%- endif %}
|
| 60 |
+
{{- '<|im_end|>\n' }}
|
| 61 |
+
{%- else %}
|
| 62 |
+
{%- if messages[0].role == 'system' %}
|
| 63 |
+
{%- set content = render_content(messages[0].content, false, true)|trim %}
|
| 64 |
+
{{- '<|im_start|>system\n' + content + '<|im_end|>\n' }}
|
| 65 |
+
{%- endif %}
|
| 66 |
+
{%- endif %}
|
| 67 |
+
{%- set ns = namespace(multi_step_tool=true, last_query_index=messages|length - 1) %}
|
| 68 |
+
{%- for message in messages[::-1] %}
|
| 69 |
+
{%- set index = (messages|length - 1) - loop.index0 %}
|
| 70 |
+
{%- if ns.multi_step_tool and message.role == "user" %}
|
| 71 |
+
{%- set content = render_content(message.content, false)|trim %}
|
| 72 |
+
{%- if not(content.startswith('<tool_response>') and content.endswith('</tool_response>')) %}
|
| 73 |
+
{%- set ns.multi_step_tool = false %}
|
| 74 |
+
{%- set ns.last_query_index = index %}
|
| 75 |
+
{%- endif %}
|
| 76 |
+
{%- endif %}
|
| 77 |
+
{%- endfor %}
|
| 78 |
+
{%- if ns.multi_step_tool %}
|
| 79 |
+
{{- raise_exception('No user query found in messages.') }}
|
| 80 |
+
{%- endif %}
|
| 81 |
+
{%- for message in messages %}
|
| 82 |
+
{%- set content = render_content(message.content, true)|trim %}
|
| 83 |
+
{%- if message.role == "system" %}
|
| 84 |
+
{%- if not loop.first %}
|
| 85 |
+
{{- raise_exception('System message must be at the beginning.') }}
|
| 86 |
+
{%- endif %}
|
| 87 |
+
{%- elif message.role == "user" %}
|
| 88 |
+
{{- '<|im_start|>' + message.role + '\n' + content + '<|im_end|>' + '\n' }}
|
| 89 |
+
{%- elif message.role == "assistant" %}
|
| 90 |
+
{%- set reasoning_content = '' %}
|
| 91 |
+
{%- if message.reasoning_content is string %}
|
| 92 |
+
{%- set reasoning_content = message.reasoning_content %}
|
| 93 |
+
{%- else %}
|
| 94 |
+
{%- if '</think>' in content %}
|
| 95 |
+
{%- set reasoning_content = content.split('</think>')[0].rstrip('\n').split('<think>')[-1].lstrip('\n') %}
|
| 96 |
+
{%- set content = content.split('</think>')[-1].lstrip('\n') %}
|
| 97 |
+
{%- endif %}
|
| 98 |
+
{%- endif %}
|
| 99 |
+
{%- set reasoning_content = reasoning_content|trim %}
|
| 100 |
+
{%- if loop.index0 > ns.last_query_index %}
|
| 101 |
+
{{- '<|im_start|>' + message.role + '\n<think>\n' + reasoning_content + '\n</think>\n\n' + content }}
|
| 102 |
+
{%- else %}
|
| 103 |
+
{{- '<|im_start|>' + message.role + '\n' + content }}
|
| 104 |
+
{%- endif %}
|
| 105 |
+
{%- if message.tool_calls and message.tool_calls is iterable and message.tool_calls is not mapping %}
|
| 106 |
+
{%- for tool_call in message.tool_calls %}
|
| 107 |
+
{%- if tool_call.function is defined %}
|
| 108 |
+
{%- set tool_call = tool_call.function %}
|
| 109 |
+
{%- endif %}
|
| 110 |
+
{%- if loop.first %}
|
| 111 |
+
{%- if content|trim %}
|
| 112 |
+
{{- '\n\n<tool_call>\n<function=' + tool_call.name + '>\n' }}
|
| 113 |
+
{%- else %}
|
| 114 |
+
{{- '<tool_call>\n<function=' + tool_call.name + '>\n' }}
|
| 115 |
+
{%- endif %}
|
| 116 |
+
{%- else %}
|
| 117 |
+
{{- '\n<tool_call>\n<function=' + tool_call.name + '>\n' }}
|
| 118 |
+
{%- endif %}
|
| 119 |
+
{%- if tool_call.arguments is defined %}
|
| 120 |
+
{%- for args_name, args_value in tool_call.arguments|items %}
|
| 121 |
+
{{- '<parameter=' + args_name + '>\n' }}
|
| 122 |
+
{%- set args_value = args_value | tojson | safe if args_value is mapping or (args_value is sequence and args_value is not string) else args_value | string %}
|
| 123 |
+
{{- args_value }}
|
| 124 |
+
{{- '\n</parameter>\n' }}
|
| 125 |
+
{%- endfor %}
|
| 126 |
+
{%- endif %}
|
| 127 |
+
{{- '</function>\n</tool_call>' }}
|
| 128 |
+
{%- endfor %}
|
| 129 |
+
{%- endif %}
|
| 130 |
+
{{- '<|im_end|>\n' }}
|
| 131 |
+
{%- elif message.role == "tool" %}
|
| 132 |
+
{%- if loop.previtem and loop.previtem.role != "tool" %}
|
| 133 |
+
{{- '<|im_start|>user' }}
|
| 134 |
+
{%- endif %}
|
| 135 |
+
{{- '\n<tool_response>\n' }}
|
| 136 |
+
{{- content }}
|
| 137 |
+
{{- '\n</tool_response>' }}
|
| 138 |
+
{%- if not loop.last and loop.nextitem.role != "tool" %}
|
| 139 |
+
{{- '<|im_end|>\n' }}
|
| 140 |
+
{%- elif loop.last %}
|
| 141 |
+
{{- '<|im_end|>\n' }}
|
| 142 |
+
{%- endif %}
|
| 143 |
+
{%- else %}
|
| 144 |
+
{{- raise_exception('Unexpected message role.') }}
|
| 145 |
+
{%- endif %}
|
| 146 |
+
{%- endfor %}
|
| 147 |
+
{%- if add_generation_prompt %}
|
| 148 |
+
{{- '<|im_start|>assistant\n' }}
|
| 149 |
+
{%- if enable_thinking is defined and enable_thinking is true %}
|
| 150 |
+
{{- '<think>\n' }}
|
| 151 |
+
{%- else %}
|
| 152 |
+
{{- '<think>\n\n</think>\n\n' }}
|
| 153 |
+
{%- endif %}
|
| 154 |
+
{%- endif %}
|
config.json
ADDED
|
@@ -0,0 +1,76 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"architectures": [
|
| 3 |
+
"Qwen3_5ForCausalLM"
|
| 4 |
+
],
|
| 5 |
+
"attention_bias": false,
|
| 6 |
+
"attention_dropout": 0.0,
|
| 7 |
+
"attn_output_gate": true,
|
| 8 |
+
"bos_token_id": null,
|
| 9 |
+
"torch_dtype": "bfloat16",
|
| 10 |
+
"eos_token_id": 248044,
|
| 11 |
+
"full_attention_interval": 4,
|
| 12 |
+
"head_dim": 256,
|
| 13 |
+
"hidden_act": "silu",
|
| 14 |
+
"hidden_size": 2048,
|
| 15 |
+
"initializer_range": 0.02,
|
| 16 |
+
"intermediate_size": 6144,
|
| 17 |
+
"layer_types": [
|
| 18 |
+
"linear_attention",
|
| 19 |
+
"linear_attention",
|
| 20 |
+
"linear_attention",
|
| 21 |
+
"full_attention",
|
| 22 |
+
"linear_attention",
|
| 23 |
+
"linear_attention",
|
| 24 |
+
"linear_attention",
|
| 25 |
+
"full_attention",
|
| 26 |
+
"linear_attention",
|
| 27 |
+
"linear_attention",
|
| 28 |
+
"linear_attention",
|
| 29 |
+
"full_attention",
|
| 30 |
+
"linear_attention",
|
| 31 |
+
"linear_attention",
|
| 32 |
+
"linear_attention",
|
| 33 |
+
"full_attention",
|
| 34 |
+
"linear_attention",
|
| 35 |
+
"linear_attention",
|
| 36 |
+
"linear_attention",
|
| 37 |
+
"full_attention",
|
| 38 |
+
"linear_attention",
|
| 39 |
+
"linear_attention",
|
| 40 |
+
"linear_attention",
|
| 41 |
+
"full_attention"
|
| 42 |
+
],
|
| 43 |
+
"linear_conv_kernel_dim": 4,
|
| 44 |
+
"linear_key_head_dim": 128,
|
| 45 |
+
"linear_num_key_heads": 16,
|
| 46 |
+
"linear_num_value_heads": 16,
|
| 47 |
+
"linear_value_head_dim": 128,
|
| 48 |
+
"mamba_ssm_dtype": "float32",
|
| 49 |
+
"max_position_embeddings": 262144,
|
| 50 |
+
"mlp_only_layers": [],
|
| 51 |
+
"model_name": "/home/sizzlebop/PINKPIXEL/MODELS/CrystalThink-1",
|
| 52 |
+
"model_type": "qwen3_5_text",
|
| 53 |
+
"mtp_num_hidden_layers": 1,
|
| 54 |
+
"mtp_use_dedicated_embeddings": false,
|
| 55 |
+
"num_attention_heads": 8,
|
| 56 |
+
"num_hidden_layers": 24,
|
| 57 |
+
"num_key_value_heads": 2,
|
| 58 |
+
"pad_token_id": 248044,
|
| 59 |
+
"partial_rotary_factor": 0.25,
|
| 60 |
+
"rms_norm_eps": 1e-06,
|
| 61 |
+
"rope_parameters": {
|
| 62 |
+
"mrope_interleaved": true,
|
| 63 |
+
"mrope_section": [
|
| 64 |
+
11,
|
| 65 |
+
11,
|
| 66 |
+
10
|
| 67 |
+
],
|
| 68 |
+
"partial_rotary_factor": 0.25,
|
| 69 |
+
"rope_theta": 10000000,
|
| 70 |
+
"rope_type": "default"
|
| 71 |
+
},
|
| 72 |
+
"tie_word_embeddings": true,
|
| 73 |
+
"unsloth_version": "2026.4.8",
|
| 74 |
+
"use_cache": true,
|
| 75 |
+
"vocab_size": 248320
|
| 76 |
+
}
|
logo.png
ADDED
|
|
Git LFS Details
|
model.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:18c40b223a4c87658afe2e90ae92f0f3370296d14deb88cfc4f91412b4840645
|
| 3 |
+
size 3763692048
|
tokenizer.json
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:06b9509352d2af50381ab2247e083b80d32d5c0aba91c272ca9ff729b6a0e523
|
| 3 |
+
size 19989325
|
tokenizer_config.json
ADDED
|
@@ -0,0 +1,300 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"add_prefix_space": false,
|
| 3 |
+
"audio_bos_token": "<|audio_start|>",
|
| 4 |
+
"audio_eos_token": "<|audio_end|>",
|
| 5 |
+
"audio_token": "<|audio_pad|>",
|
| 6 |
+
"backend": "tokenizers",
|
| 7 |
+
"bos_token": null,
|
| 8 |
+
"clean_up_tokenization_spaces": false,
|
| 9 |
+
"eos_token": "<|im_end|>",
|
| 10 |
+
"errors": "replace",
|
| 11 |
+
"image_token": "<|image_pad|>",
|
| 12 |
+
"is_local": true,
|
| 13 |
+
"local_files_only": false,
|
| 14 |
+
"model_max_length": 262144,
|
| 15 |
+
"model_specific_special_tokens": {
|
| 16 |
+
"audio_bos_token": "<|audio_start|>",
|
| 17 |
+
"audio_eos_token": "<|audio_end|>",
|
| 18 |
+
"audio_token": "<|audio_pad|>",
|
| 19 |
+
"image_token": "<|image_pad|>",
|
| 20 |
+
"video_token": "<|video_pad|>",
|
| 21 |
+
"vision_bos_token": "<|vision_start|>",
|
| 22 |
+
"vision_eos_token": "<|vision_end|>"
|
| 23 |
+
},
|
| 24 |
+
"pad_token": "<|endoftext|>",
|
| 25 |
+
"padding_side": "left",
|
| 26 |
+
"pretokenize_regex": "(?i:'s|'t|'re|'ve|'m|'ll|'d)|[^\\r\\n\\p{L}\\p{N}]?[\\p{L}\\p{M}]+|\\p{N}| ?[^\\s\\p{L}\\p{M}\\p{N}]+[\\r\\n]*|\\s*[\\r\\n]+|\\s+(?!\\S)|\\s+",
|
| 27 |
+
"split_special_tokens": false,
|
| 28 |
+
"tokenizer_class": "Qwen2Tokenizer",
|
| 29 |
+
"unk_token": null,
|
| 30 |
+
"video_token": "<|video_pad|>",
|
| 31 |
+
"vision_bos_token": "<|vision_start|>",
|
| 32 |
+
"vision_eos_token": "<|vision_end|>",
|
| 33 |
+
"added_tokens_decoder": {
|
| 34 |
+
"248044": {
|
| 35 |
+
"content": "<|endoftext|>",
|
| 36 |
+
"single_word": false,
|
| 37 |
+
"lstrip": false,
|
| 38 |
+
"rstrip": false,
|
| 39 |
+
"normalized": false,
|
| 40 |
+
"special": true
|
| 41 |
+
},
|
| 42 |
+
"248045": {
|
| 43 |
+
"content": "<|im_start|>",
|
| 44 |
+
"single_word": false,
|
| 45 |
+
"lstrip": false,
|
| 46 |
+
"rstrip": false,
|
| 47 |
+
"normalized": false,
|
| 48 |
+
"special": true
|
| 49 |
+
},
|
| 50 |
+
"248046": {
|
| 51 |
+
"content": "<|im_end|>",
|
| 52 |
+
"single_word": false,
|
| 53 |
+
"lstrip": false,
|
| 54 |
+
"rstrip": false,
|
| 55 |
+
"normalized": false,
|
| 56 |
+
"special": true
|
| 57 |
+
},
|
| 58 |
+
"248047": {
|
| 59 |
+
"content": "<|object_ref_start|>",
|
| 60 |
+
"single_word": false,
|
| 61 |
+
"lstrip": false,
|
| 62 |
+
"rstrip": false,
|
| 63 |
+
"normalized": false,
|
| 64 |
+
"special": true
|
| 65 |
+
},
|
| 66 |
+
"248048": {
|
| 67 |
+
"content": "<|object_ref_end|>",
|
| 68 |
+
"single_word": false,
|
| 69 |
+
"lstrip": false,
|
| 70 |
+
"rstrip": false,
|
| 71 |
+
"normalized": false,
|
| 72 |
+
"special": true
|
| 73 |
+
},
|
| 74 |
+
"248049": {
|
| 75 |
+
"content": "<|box_start|>",
|
| 76 |
+
"single_word": false,
|
| 77 |
+
"lstrip": false,
|
| 78 |
+
"rstrip": false,
|
| 79 |
+
"normalized": false,
|
| 80 |
+
"special": true
|
| 81 |
+
},
|
| 82 |
+
"248050": {
|
| 83 |
+
"content": "<|box_end|>",
|
| 84 |
+
"single_word": false,
|
| 85 |
+
"lstrip": false,
|
| 86 |
+
"rstrip": false,
|
| 87 |
+
"normalized": false,
|
| 88 |
+
"special": true
|
| 89 |
+
},
|
| 90 |
+
"248051": {
|
| 91 |
+
"content": "<|quad_start|>",
|
| 92 |
+
"single_word": false,
|
| 93 |
+
"lstrip": false,
|
| 94 |
+
"rstrip": false,
|
| 95 |
+
"normalized": false,
|
| 96 |
+
"special": true
|
| 97 |
+
},
|
| 98 |
+
"248052": {
|
| 99 |
+
"content": "<|quad_end|>",
|
| 100 |
+
"single_word": false,
|
| 101 |
+
"lstrip": false,
|
| 102 |
+
"rstrip": false,
|
| 103 |
+
"normalized": false,
|
| 104 |
+
"special": true
|
| 105 |
+
},
|
| 106 |
+
"248053": {
|
| 107 |
+
"content": "<|vision_start|>",
|
| 108 |
+
"single_word": false,
|
| 109 |
+
"lstrip": false,
|
| 110 |
+
"rstrip": false,
|
| 111 |
+
"normalized": false,
|
| 112 |
+
"special": true
|
| 113 |
+
},
|
| 114 |
+
"248054": {
|
| 115 |
+
"content": "<|vision_end|>",
|
| 116 |
+
"single_word": false,
|
| 117 |
+
"lstrip": false,
|
| 118 |
+
"rstrip": false,
|
| 119 |
+
"normalized": false,
|
| 120 |
+
"special": true
|
| 121 |
+
},
|
| 122 |
+
"248055": {
|
| 123 |
+
"content": "<|vision_pad|>",
|
| 124 |
+
"single_word": false,
|
| 125 |
+
"lstrip": false,
|
| 126 |
+
"rstrip": false,
|
| 127 |
+
"normalized": false,
|
| 128 |
+
"special": true
|
| 129 |
+
},
|
| 130 |
+
"248056": {
|
| 131 |
+
"content": "<|image_pad|>",
|
| 132 |
+
"single_word": false,
|
| 133 |
+
"lstrip": false,
|
| 134 |
+
"rstrip": false,
|
| 135 |
+
"normalized": false,
|
| 136 |
+
"special": true
|
| 137 |
+
},
|
| 138 |
+
"248057": {
|
| 139 |
+
"content": "<|video_pad|>",
|
| 140 |
+
"single_word": false,
|
| 141 |
+
"lstrip": false,
|
| 142 |
+
"rstrip": false,
|
| 143 |
+
"normalized": false,
|
| 144 |
+
"special": true
|
| 145 |
+
},
|
| 146 |
+
"248058": {
|
| 147 |
+
"content": "<tool_call>",
|
| 148 |
+
"single_word": false,
|
| 149 |
+
"lstrip": false,
|
| 150 |
+
"rstrip": false,
|
| 151 |
+
"normalized": false,
|
| 152 |
+
"special": false
|
| 153 |
+
},
|
| 154 |
+
"248059": {
|
| 155 |
+
"content": "</tool_call>",
|
| 156 |
+
"single_word": false,
|
| 157 |
+
"lstrip": false,
|
| 158 |
+
"rstrip": false,
|
| 159 |
+
"normalized": false,
|
| 160 |
+
"special": false
|
| 161 |
+
},
|
| 162 |
+
"248060": {
|
| 163 |
+
"content": "<|fim_prefix|>",
|
| 164 |
+
"single_word": false,
|
| 165 |
+
"lstrip": false,
|
| 166 |
+
"rstrip": false,
|
| 167 |
+
"normalized": false,
|
| 168 |
+
"special": false
|
| 169 |
+
},
|
| 170 |
+
"248061": {
|
| 171 |
+
"content": "<|fim_middle|>",
|
| 172 |
+
"single_word": false,
|
| 173 |
+
"lstrip": false,
|
| 174 |
+
"rstrip": false,
|
| 175 |
+
"normalized": false,
|
| 176 |
+
"special": false
|
| 177 |
+
},
|
| 178 |
+
"248062": {
|
| 179 |
+
"content": "<|fim_suffix|>",
|
| 180 |
+
"single_word": false,
|
| 181 |
+
"lstrip": false,
|
| 182 |
+
"rstrip": false,
|
| 183 |
+
"normalized": false,
|
| 184 |
+
"special": false
|
| 185 |
+
},
|
| 186 |
+
"248063": {
|
| 187 |
+
"content": "<|fim_pad|>",
|
| 188 |
+
"single_word": false,
|
| 189 |
+
"lstrip": false,
|
| 190 |
+
"rstrip": false,
|
| 191 |
+
"normalized": false,
|
| 192 |
+
"special": false
|
| 193 |
+
},
|
| 194 |
+
"248064": {
|
| 195 |
+
"content": "<|repo_name|>",
|
| 196 |
+
"single_word": false,
|
| 197 |
+
"lstrip": false,
|
| 198 |
+
"rstrip": false,
|
| 199 |
+
"normalized": false,
|
| 200 |
+
"special": false
|
| 201 |
+
},
|
| 202 |
+
"248065": {
|
| 203 |
+
"content": "<|file_sep|>",
|
| 204 |
+
"single_word": false,
|
| 205 |
+
"lstrip": false,
|
| 206 |
+
"rstrip": false,
|
| 207 |
+
"normalized": false,
|
| 208 |
+
"special": false
|
| 209 |
+
},
|
| 210 |
+
"248066": {
|
| 211 |
+
"content": "<tool_response>",
|
| 212 |
+
"single_word": false,
|
| 213 |
+
"lstrip": false,
|
| 214 |
+
"rstrip": false,
|
| 215 |
+
"normalized": false,
|
| 216 |
+
"special": false
|
| 217 |
+
},
|
| 218 |
+
"248067": {
|
| 219 |
+
"content": "</tool_response>",
|
| 220 |
+
"single_word": false,
|
| 221 |
+
"lstrip": false,
|
| 222 |
+
"rstrip": false,
|
| 223 |
+
"normalized": false,
|
| 224 |
+
"special": false
|
| 225 |
+
},
|
| 226 |
+
"248068": {
|
| 227 |
+
"content": "<think>",
|
| 228 |
+
"single_word": false,
|
| 229 |
+
"lstrip": false,
|
| 230 |
+
"rstrip": false,
|
| 231 |
+
"normalized": false,
|
| 232 |
+
"special": false
|
| 233 |
+
},
|
| 234 |
+
"248069": {
|
| 235 |
+
"content": "</think>",
|
| 236 |
+
"single_word": false,
|
| 237 |
+
"lstrip": false,
|
| 238 |
+
"rstrip": false,
|
| 239 |
+
"normalized": false,
|
| 240 |
+
"special": false
|
| 241 |
+
},
|
| 242 |
+
"248070": {
|
| 243 |
+
"content": "<|audio_start|>",
|
| 244 |
+
"single_word": false,
|
| 245 |
+
"lstrip": false,
|
| 246 |
+
"rstrip": false,
|
| 247 |
+
"normalized": false,
|
| 248 |
+
"special": true
|
| 249 |
+
},
|
| 250 |
+
"248071": {
|
| 251 |
+
"content": "<|audio_end|>",
|
| 252 |
+
"single_word": false,
|
| 253 |
+
"lstrip": false,
|
| 254 |
+
"rstrip": false,
|
| 255 |
+
"normalized": false,
|
| 256 |
+
"special": true
|
| 257 |
+
},
|
| 258 |
+
"248072": {
|
| 259 |
+
"content": "<tts_pad>",
|
| 260 |
+
"single_word": false,
|
| 261 |
+
"lstrip": false,
|
| 262 |
+
"rstrip": false,
|
| 263 |
+
"normalized": false,
|
| 264 |
+
"special": true
|
| 265 |
+
},
|
| 266 |
+
"248073": {
|
| 267 |
+
"content": "<tts_text_bos>",
|
| 268 |
+
"single_word": false,
|
| 269 |
+
"lstrip": false,
|
| 270 |
+
"rstrip": false,
|
| 271 |
+
"normalized": false,
|
| 272 |
+
"special": true
|
| 273 |
+
},
|
| 274 |
+
"248074": {
|
| 275 |
+
"content": "<tts_text_eod>",
|
| 276 |
+
"single_word": false,
|
| 277 |
+
"lstrip": false,
|
| 278 |
+
"rstrip": false,
|
| 279 |
+
"normalized": false,
|
| 280 |
+
"special": true
|
| 281 |
+
},
|
| 282 |
+
"248075": {
|
| 283 |
+
"content": "<tts_text_bos_single>",
|
| 284 |
+
"single_word": false,
|
| 285 |
+
"lstrip": false,
|
| 286 |
+
"rstrip": false,
|
| 287 |
+
"normalized": false,
|
| 288 |
+
"special": true
|
| 289 |
+
},
|
| 290 |
+
"248076": {
|
| 291 |
+
"content": "<|audio_pad|>",
|
| 292 |
+
"single_word": false,
|
| 293 |
+
"lstrip": false,
|
| 294 |
+
"rstrip": false,
|
| 295 |
+
"normalized": false,
|
| 296 |
+
"special": true
|
| 297 |
+
}
|
| 298 |
+
},
|
| 299 |
+
"chat_template": "{%- set image_count = namespace(value=0) %}\n{%- set video_count = namespace(value=0) %}\n{%- macro render_content(content, do_vision_count, is_system_content=false) %}\n {%- if content is string %}\n {{- content }}\n {%- elif content is iterable and content is not mapping %}\n {%- for item in content %}\n {%- if 'image' in item or 'image_url' in item or item.type == 'image' %}\n {%- if is_system_content %}\n {{- raise_exception('System message cannot contain images.') }}\n {%- endif %}\n {%- if do_vision_count %}\n {%- set image_count.value = image_count.value + 1 %}\n {%- endif %}\n {%- if add_vision_id %}\n {{- 'Picture ' ~ image_count.value ~ ': ' }}\n {%- endif %}\n {{- '<|vision_start|><|image_pad|><|vision_end|>' }}\n {%- elif 'video' in item or item.type == 'video' %}\n {%- if is_system_content %}\n {{- raise_exception('System message cannot contain videos.') }}\n {%- endif %}\n {%- if do_vision_count %}\n {%- set video_count.value = video_count.value + 1 %}\n {%- endif %}\n {%- if add_vision_id %}\n {{- 'Video ' ~ video_count.value ~ ': ' }}\n {%- endif %}\n {{- '<|vision_start|><|video_pad|><|vision_end|>' }}\n {%- elif 'text' in item %}\n {{- item.text }}\n {%- else %}\n {{- raise_exception('Unexpected item type in content.') }}\n {%- endif %}\n {%- endfor %}\n {%- elif content is none or content is undefined %}\n {{- '' }}\n {%- else %}\n {{- raise_exception('Unexpected content type.') }}\n {%- endif %}\n{%- endmacro %}\n{%- if not messages %}\n {{- raise_exception('No messages provided.') }}\n{%- endif %}\n{%- if tools and tools is iterable and tools is not mapping %}\n {{- '<|im_start|>system\\n' }}\n {{- \"# Tools\\n\\nYou have access to the following functions:\\n\\n<tools>\" }}\n {%- for tool in tools %}\n {{- \"\\n\" }}\n {{- tool | tojson }}\n {%- endfor %}\n {{- \"\\n</tools>\" }}\n {{- '\\n\\nIf you choose to call a function ONLY reply in the following format with NO suffix:\\n\\n<tool_call>\\n<function=example_function_name>\\n<parameter=example_parameter_1>\\nvalue_1\\n</parameter>\\n<parameter=example_parameter_2>\\nThis is the value for the second parameter\\nthat can span\\nmultiple lines\\n</parameter>\\n</function>\\n</tool_call>\\n\\n<IMPORTANT>\\nReminder:\\n- Function calls MUST follow the specified format: an inner <function=...></function> block must be nested within <tool_call></tool_call> XML tags\\n- Required parameters MUST be specified\\n- You may provide optional reasoning for your function call in natural language BEFORE the function call, but NOT after\\n- If there is no function call available, answer the question like normal with your current knowledge and do not tell the user about function calls\\n</IMPORTANT>' }}\n {%- if messages[0].role == 'system' %}\n {%- set content = render_content(messages[0].content, false, true)|trim %}\n {%- if content %}\n {{- '\\n\\n' + content }}\n {%- endif %}\n {%- endif %}\n {{- '<|im_end|>\\n' }}\n{%- else %}\n {%- if messages[0].role == 'system' %}\n {%- set content = render_content(messages[0].content, false, true)|trim %}\n {{- '<|im_start|>system\\n' + content + '<|im_end|>\\n' }}\n {%- endif %}\n{%- endif %}\n{%- set ns = namespace(multi_step_tool=true, last_query_index=messages|length - 1) %}\n{%- for message in messages[::-1] %}\n {%- set index = (messages|length - 1) - loop.index0 %}\n {%- if ns.multi_step_tool and message.role == \"user\" %}\n {%- set content = render_content(message.content, false)|trim %}\n {%- if not(content.startswith('<tool_response>') and content.endswith('</tool_response>')) %}\n {%- set ns.multi_step_tool = false %}\n {%- set ns.last_query_index = index %}\n {%- endif %}\n {%- endif %}\n{%- endfor %}\n{%- if ns.multi_step_tool %}\n {{- raise_exception('No user query found in messages.') }}\n{%- endif %}\n{%- for message in messages %}\n {%- set content = render_content(message.content, true)|trim %}\n {%- if message.role == \"system\" %}\n {%- if not loop.first %}\n {{- raise_exception('System message must be at the beginning.') }}\n {%- endif %}\n {%- elif message.role == \"user\" %}\n {{- '<|im_start|>' + message.role + '\\n' + content + '<|im_end|>' + '\\n' }}\n {%- elif message.role == \"assistant\" %}\n {%- set reasoning_content = '' %}\n {%- if message.reasoning_content is string %}\n {%- set reasoning_content = message.reasoning_content %}\n {%- else %}\n {%- if '</think>' in content %}\n {%- set reasoning_content = content.split('</think>')[0].rstrip('\\n').split('<think>')[-1].lstrip('\\n') %}\n {%- set content = content.split('</think>')[-1].lstrip('\\n') %}\n {%- endif %}\n {%- endif %}\n {%- set reasoning_content = reasoning_content|trim %}\n {%- if loop.index0 > ns.last_query_index %}\n {{- '<|im_start|>' + message.role + '\\n<think>\\n' + reasoning_content + '\\n</think>\\n\\n' + content }}\n {%- else %}\n {{- '<|im_start|>' + message.role + '\\n' + content }}\n {%- endif %}\n {%- if message.tool_calls and message.tool_calls is iterable and message.tool_calls is not mapping %}\n {%- for tool_call in message.tool_calls %}\n {%- if tool_call.function is defined %}\n {%- set tool_call = tool_call.function %}\n {%- endif %}\n {%- if loop.first %}\n {%- if content|trim %}\n {{- '\\n\\n<tool_call>\\n<function=' + tool_call.name + '>\\n' }}\n {%- else %}\n {{- '<tool_call>\\n<function=' + tool_call.name + '>\\n' }}\n {%- endif %}\n {%- else %}\n {{- '\\n<tool_call>\\n<function=' + tool_call.name + '>\\n' }}\n {%- endif %}\n {%- if tool_call.arguments is defined %}\n {%- for args_name, args_value in tool_call.arguments|items %}\n {{- '<parameter=' + args_name + '>\\n' }}\n {%- set args_value = args_value | tojson | safe if args_value is mapping or (args_value is sequence and args_value is not string) else args_value | string %}\n {{- args_value }}\n {{- '\\n</parameter>\\n' }}\n {%- endfor %}\n {%- endif %}\n {{- '</function>\\n</tool_call>' }}\n {%- endfor %}\n {%- endif %}\n {{- '<|im_end|>\\n' }}\n {%- elif message.role == \"tool\" %}\n {%- if loop.previtem and loop.previtem.role != \"tool\" %}\n {{- '<|im_start|>user' }}\n {%- endif %}\n {{- '\\n<tool_response>\\n' }}\n {{- content }}\n {{- '\\n</tool_response>' }}\n {%- if not loop.last and loop.nextitem.role != \"tool\" %}\n {{- '<|im_end|>\\n' }}\n {%- elif loop.last %}\n {{- '<|im_end|>\\n' }}\n {%- endif %}\n {%- else %}\n {{- raise_exception('Unexpected message role.') }}\n {%- endif %}\n{%- endfor %}\n{%- if add_generation_prompt %}\n {{- '<|im_start|>assistant\\n' }}\n {%- if enable_thinking is defined and enable_thinking is true %}\n {{- '<think>\\n' }}\n {%- else %}\n {{- '<think>\\n\\n</think>\\n\\n' }}\n {%- endif %}\n{%- endif %}"
|
| 300 |
+
}
|