linzhao-amd commited on
Commit
8c1b558
·
verified ·
1 Parent(s): 566ab5a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +90 -0
README.md CHANGED
@@ -50,5 +50,95 @@ python3 quantize_quark.py --model_dir $MODEL_DIR \
50
 
51
  This model can be deployed efficiently using the [SGLang](https://docs.sglang.ai/) backend.
52
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
53
  # License
54
  Modifications Copyright(c) 2025 Advanced Micro Devices, Inc. All rights reserved.
 
50
 
51
  This model can be deployed efficiently using the [SGLang](https://docs.sglang.ai/) backend.
52
 
53
+ ## Evaluation
54
+
55
+ The model was evaluated using [SGLang](https://docs.sglang.ai/) and [lm-evaluation-harness](https://github.com/EleutherAI/lm-evaluation-harness) frameworks.
56
+
57
+ ### Accuracy
58
+
59
+ <table>
60
+ <tr>
61
+ <td><strong>Benchmark</strong>
62
+ </td>
63
+ <td><strong>DeepSeek-R1 </strong>
64
+ </td>
65
+ <td><strong>DeepSeek-R1-MXFP4-ASQ(this model)</strong>
66
+ </td>
67
+ <td><strong>Recovery</strong>
68
+ </td>
69
+ </tr>
70
+ <tr>
71
+ <td>AIME24
72
+ </td>
73
+ <td>76.0
74
+ </td>
75
+ <td>78.0
76
+ </td>
77
+ <td>97.44%
78
+ </td>
79
+ </tr>
80
+ <tr>
81
+ <td>MMLU_COT
82
+ </td>
83
+ <td>79.90
84
+ </td>
85
+ <td>79.65
86
+ </td>
87
+ <td>99.69%
88
+ </td>
89
+ </tr>
90
+ <tr>
91
+ <td>GSM8K
92
+ </td>
93
+ <td>95.81
94
+ </td>
95
+ <td>95.42
96
+ </td>
97
+ <td>99.59%
98
+ </td>
99
+ </tr>
100
+ </table>
101
+
102
+
103
+ ### Reproduction
104
+
105
+ The results were obtained using [lm-evaluation-harness](https://github.com/EleutherAI/lm-evaluation-harness) with custom evaluation tasks AIME24 and MMLU_CoT.
106
+
107
+ ### AIME24
108
+ ```
109
+ lm_eval --model local-completions \
110
+ --model_args model=amd/DeepSeek-R1-MXFP4-ASQ,base_url=http://localhost:30000/v1/completions,num_concurrent=999999,timeout=999999,tokenized_requests=False,max_length=32000,temperature=0.6,top_p=0.95 \
111
+ --tasks aime24 \
112
+ --num_fewshot 0 \
113
+ --gen_kwargs "do_sample=True,temperature=0.6,top_p=0.95,max_tokens=32000" \
114
+ --batch_size auto \
115
+ --log_samples \
116
+ --output_path output_data/aime24 2>&1 | tee logs/aime24.log
117
+ ```
118
+
119
+ ### MMLU_COT
120
+ ```
121
+ lm_eval --model local-completions \
122
+ --model_args model=amd/DeepSeek-R1-MXFP4-ASQ,base_url=http://localhost:30000/v1/completions,num_concurrent=999999,timeout=999999,tokenized_requests=False,max_length=32000,temperature=0.6,top_p=0.95 \
123
+ --tasks mmlu_cot \
124
+ --num_fewshot 0 \
125
+ --gen_kwargs "do_sample=True,temperature=0.6,top_p=0.95,max_tokens=32000" \
126
+ --batch_size auto \
127
+ --log_samples \
128
+ --output_path output_data/mmmlu_cot 2>&1 | tee logs/mmmlu_cot.log
129
+ ```
130
+
131
+ ### GSM8K
132
+ ```
133
+ lm_eval --model local-completions \
134
+ --model_args model=amd/DeepSeek-R1-MXFP4-ASQ,base_url=http://localhost:30000/v1/completions,num_concurrent=999999,timeout=999999,tokenized_requests=False,max_length=8096 \
135
+ --tasks gsm8k \
136
+ --num_fewshot 5 \
137
+ --batch_size auto \
138
+ --log_samples \
139
+ --output_path output_data/gsm8k 2>&1 | tee logs/gsm8k.log
140
+ ```
141
+
142
+
143
  # License
144
  Modifications Copyright(c) 2025 Advanced Micro Devices, Inc. All rights reserved.