Xinging commited on
Commit
1eb226b
·
verified ·
1 Parent(s): 0a19de1

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +7 -3
README.md CHANGED
@@ -1,6 +1,6 @@
1
  ---
2
  library_name: transformers
3
- license: other
4
  base_model: deepseek-ai/DeepSeek-R1-Distill-Qwen-7B
5
  tags:
6
  - llama-factory
@@ -9,6 +9,10 @@ tags:
9
  model-index:
10
  - name: Proactive-Interactive-R1-SFT-7B
11
  results: []
 
 
 
 
12
  ---
13
 
14
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -16,7 +20,7 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  # Proactive-Interactive-R1-SFT-7B
18
 
19
- This model is a fine-tuned version of [deepseek-ai/DeepSeek-R1-Distill-Qwen-7B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-7B) on the Proactive-Interactive-R1-SFT-7B dataset.
20
 
21
  ## Model description
22
 
@@ -58,4 +62,4 @@ The following hyperparameters were used during training:
58
  - Transformers 4.55.0
59
  - Pytorch 2.8.0+cu128
60
  - Datasets 3.6.0
61
- - Tokenizers 0.21.1
 
1
  ---
2
  library_name: transformers
3
+ license: apache-2.0
4
  base_model: deepseek-ai/DeepSeek-R1-Distill-Qwen-7B
5
  tags:
6
  - llama-factory
 
9
  model-index:
10
  - name: Proactive-Interactive-R1-SFT-7B
11
  results: []
12
+ datasets:
13
+ - Proactive-Interactive-R1/Reasoning-While-Asking-SFT-Dataset
14
+ language:
15
+ - en
16
  ---
17
 
18
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
20
 
21
  # Proactive-Interactive-R1-SFT-7B
22
 
23
+ This model is a fine-tuned version of [deepseek-ai/DeepSeek-R1-Distill-Qwen-7B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-7B) on the Reasoning-While-Asking-SFT-Dataset dataset.
24
 
25
  ## Model description
26
 
 
62
  - Transformers 4.55.0
63
  - Pytorch 2.8.0+cu128
64
  - Datasets 3.6.0
65
+ - Tokenizers 0.21.1