Proactive-Interactive-R1
/

Proactive-Interactive-R1-SFT-7B

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions

Xinging commited on 5 days ago

Commit

1eb226b

·

verified ·

1 Parent(s): 0a19de1

Update README.md

Files changed (1) hide show

README.md +7 -3

README.md CHANGED Viewed

@@ -1,6 +1,6 @@
 ---
 library_name: transformers
-license: other
 base_model: deepseek-ai/DeepSeek-R1-Distill-Qwen-7B
 tags:
 - llama-factory
@@ -9,6 +9,10 @@ tags:
 model-index:
 - name: Proactive-Interactive-R1-SFT-7B
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -16,7 +20,7 @@ should probably proofread and complete it, then remove this comment. -->
 # Proactive-Interactive-R1-SFT-7B
-This model is a fine-tuned version of [deepseek-ai/DeepSeek-R1-Distill-Qwen-7B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-7B) on the Proactive-Interactive-R1-SFT-7B dataset.
 ## Model description
@@ -58,4 +62,4 @@ The following hyperparameters were used during training:
 - Transformers 4.55.0
 - Pytorch 2.8.0+cu128
 - Datasets 3.6.0
-- Tokenizers 0.21.1

 ---
 library_name: transformers
+license: apache-2.0
 base_model: deepseek-ai/DeepSeek-R1-Distill-Qwen-7B
 tags:
 - llama-factory
 model-index:
 - name: Proactive-Interactive-R1-SFT-7B
   results: []
+datasets:
+- Proactive-Interactive-R1/Reasoning-While-Asking-SFT-Dataset
+language:
+- en
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 # Proactive-Interactive-R1-SFT-7B
+This model is a fine-tuned version of [deepseek-ai/DeepSeek-R1-Distill-Qwen-7B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-7B) on the Reasoning-While-Asking-SFT-Dataset dataset.
 ## Model description
 - Transformers 4.55.0
 - Pytorch 2.8.0+cu128
 - Datasets 3.6.0
+- Tokenizers 0.21.1