QuantumStackOverflow commited on
Commit
ace14f6
·
verified ·
1 Parent(s): 1f511e1

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +70 -3
README.md CHANGED
@@ -1,3 +1,70 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ base_model: Qwen/Qwen3-4B-Thinking-2507
4
+ tags:
5
+ - aster
6
+ - reinforcement-learning
7
+ - sft
8
+ - reproduction
9
+ metrics:
10
+ - accuracy
11
+ model-index:
12
+ - name: ASTER_4B
13
+ results:
14
+ - task:
15
+ type: text-generation
16
+ name: Text Generation
17
+ dataset:
18
+ name: AIME 2025
19
+ type: aime2025
20
+ metrics:
21
+ - name: Accuracy
22
+ type: accuracy
23
+ value: 87.7
24
+ - task:
25
+ type: text-generation
26
+ name: Text Generation
27
+ dataset:
28
+ name: HMMT 2025 Feb
29
+ type: hmmt_2025_feb
30
+ metrics:
31
+ - name: Accuracy
32
+ type: accuracy
33
+ value: 77.1
34
+ ---
35
+
36
+ # ASTER_4B (Independent Reproduction)
37
+
38
+ [![Paper](https://img.shields.io/badge/Paper-ArXiv.2602.01204-B31B1B.svg)](https://arxiv.org/pdf/2602.01204)
39
+ [![GitHub](https://img.shields.io/badge/GitHub-Reproduction_Code-black)](https://github.com/Rainyrou/ASTER)
40
+ [![License](https://img.shields.io/badge/License-Apache_2.0-green.svg)](https://huggingface.co/datasets/choosealicense/licenses/apache-2.0)
41
+
42
+ ## Model Description
43
+
44
+ **ASTER_4B** is an independent reproduction of the ASTER framework. This model is fine-tuned based on [Qwen/Qwen3-4B-Thinking-2507](https://huggingface.co/Qwen/Qwen3-4B-Thinking-2507), strictly adhering to the experimental details and hyperparameter settings described in the original ASTER paper.
45
+
46
+ > ⚠️ **Note:** This is a **reproduction project**. We aim to verify the effectiveness of the ASTER method by strictly following the official paper's details.
47
+
48
+ ## Training Data (SFT)
49
+
50
+ The model was trained using our reproduced dataset: **Aster_SFT4K**.
51
+
52
+ This dataset serves as a tiny yet effective SFT set, constructed to replicate the exact data distribution and formatting used in the original ASTER experiments. You can find the dataset details here:
53
+ * **Dataset Repo:** [ASTER_SFT4K](https://huggingface.co/datasets/QuantumStackOverflow/ASTER_SFT4K)
54
+
55
+ ## Evaluation Results
56
+
57
+ We evaluated the model's performance on challenging mathematical benchmarks. The evaluation was conducted under the **exact generation configuration** specified in the ASTER paper to ensure fair comparison.
58
+
59
+ **Generation Config:**
60
+ * **Temperature:** `1.0`
61
+ * **Top_p:** `1.0`
62
+ * **Max_context_length**: `96256`
63
+
64
+ | Benchmark | Score (%) |
65
+ | :--- | :--- |
66
+ | **AIME 2025** | **87.7** |
67
+ | **HMMT 2025 (Feb)** | **77.1** |
68
+
69
+
70
+