Update README.md
Browse files
README.md
CHANGED
@@ -102,8 +102,6 @@ The table below summarizes the evaluation results across mathematical tasks and
|
|
102 |
| Hybrid Expansion | 34.4 | 61.1 | 90.1 | 61.5 | | **81.8** | **25.9**| 67.2 | **43.9** | 59.3 |
|
103 |
| **Control LLM*** | 38.1 | 62.7 | **90.4** | 63.2 | | 79.7 | 25.2 | **68.1**| 43.6 | **60.2** |
|
104 |
|
105 |
-
---
|
106 |
-
|
107 |
### Explanation of Groups
|
108 |
- **Math Tasks**:
|
109 |
- Covers **MathHard**, **Math**, and **GSM8K**, measuring the model's performance on mathematical reasoning and problem-solving tasks.
|
|
|
102 |
| Hybrid Expansion | 34.4 | 61.1 | 90.1 | 61.5 | | **81.8** | **25.9**| 67.2 | **43.9** | 59.3 |
|
103 |
| **Control LLM*** | 38.1 | 62.7 | **90.4** | 63.2 | | 79.7 | 25.2 | **68.1**| 43.6 | **60.2** |
|
104 |
|
|
|
|
|
105 |
### Explanation of Groups
|
106 |
- **Math Tasks**:
|
107 |
- Covers **MathHard**, **Math**, and **GSM8K**, measuring the model's performance on mathematical reasoning and problem-solving tasks.
|