Update README.md
Browse files
README.md
CHANGED
@@ -14,6 +14,10 @@ tags:
|
|
14 |
|
15 |
# Evaluations
|
16 |
|
|
|
|
|
|
|
|
|
17 |
|
18 |
### Model Evaluation Benchmark
|
19 |
|
@@ -23,13 +27,6 @@ tags:
|
|
23 |
| LLAMA-2-7b | 43.2 | **77.1** | 44.4 | 38.7 | 69.5 | 16 |
|
24 |
| MT7Bi (1 epoch) | 50.94 | 73.24 | - | 43.04 | 72.06 | 22.52 |
|
25 |
|
26 |
-
|
27 |
-
## Open LLM Leaderboard
|
28 |
-
|
29 |
-
| Model | ARC |HellaSwag| MMLU |TruthfulQA|Winogrande|GSM8K|
|
30 |
-
|---------------------------------------------------|----:|--------:|--------------------------|---------:|---------:|----:|
|
31 |
-
|[MT7Bi](https://huggingface.co/Technoculture/MT7Bi)|50.94| 73.24|Error: File does not exist| 43.04| 72.06|22.52|
|
32 |
-
|
33 |
### ARC: 50.94%
|
34 |
| Task |Version| Metric | Value | |Stderr|
|
35 |
|-------------|-------|--------------------|-------------|---|------|
|
|
|
14 |
|
15 |
# Evaluations
|
16 |
|
17 |
+
## Open LLM Leaderboard
|
18 |
+
| Model | ARC |HellaSwag| MMLU |TruthfulQA|Winogrande|GSM8K|
|
19 |
+
|---------------------------------------------------|----:|--------:|--------------------------|---------:|---------:|----:|
|
20 |
+
|[MT7Bi](https://huggingface.co/Technoculture/MT7Bi)|50.94| 73.24|Error: File does not exist| 43.04| 72.06|22.52|
|
21 |
|
22 |
### Model Evaluation Benchmark
|
23 |
|
|
|
27 |
| LLAMA-2-7b | 43.2 | **77.1** | 44.4 | 38.7 | 69.5 | 16 |
|
28 |
| MT7Bi (1 epoch) | 50.94 | 73.24 | - | 43.04 | 72.06 | 22.52 |
|
29 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
30 |
### ARC: 50.94%
|
31 |
| Task |Version| Metric | Value | |Stderr|
|
32 |
|-------------|-------|--------------------|-------------|---|------|
|