appvoid commited on
Commit
da4809f
·
verified ·
1 Parent(s): 6233383

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -5
README.md CHANGED
@@ -24,14 +24,14 @@ arco consistently outperforms every sota model below 600m parameters on average,
24
 
25
  #### benchmarks
26
 
27
- zero-shot evaluations performed on current sota ~0.5b models and palmer-004.
28
 
29
  | Parameters | Model | MMLU | ARC-C | HellaSwag | PIQA | Winogrande | Average |
30
  | -----------|--------------------------------|-------|-------|-----------|--------|------------|---------|
31
- | 0.5b | qwen2 |**0.4413**| 0.2892| 0.4905 | 0.6931 | 0.5699 | 0.4968 |
32
- | 0.5b | palmer-004-turbo |0.2736|0.3558|0.6179|0.7367 | 0.6117 |0.5191|
33
- | 1.1b | palmer-004 | 0.2661| 0.3490| 0.6173 |**0.7481**|**0.6417** |0.5244|
34
- | 0.5b | arco |0.2617|**0.3729**|**0.6288**|0.7437| 0.6227 |**0.5260**|
35
  #### supporters
36
 
37
  <a href="https://ko-fi.com/appvoid" target="_blank"><img src="https://cdn.buymeacoffee.com/buttons/v2/default-yellow.png" alt="Buy Me A Coffee" style="height: 34px !important; margin-top: -4px;width: 128px !important; filter: contrast(2) grayscale(100%) brightness(100%);" ></a>
 
24
 
25
  #### benchmarks
26
 
27
+ zero-shot evaluations performed on current sota ~0.5b models.
28
 
29
  | Parameters | Model | MMLU | ARC-C | HellaSwag | PIQA | Winogrande | Average |
30
  | -----------|--------------------------------|-------|-------|-----------|--------|------------|---------|
31
+ | 0.5b | qwen2 |44.13| 28.92| 49.05 | 69.31 | 56.99 | 49.68 |
32
+ | 0.5b | danube3-500m | 24.81| 36.18| 60.46| 73.78 | 61.01 | 51.25 |
33
+ | 0.5b | palmer-004-turbo |27.36|35.58|61.79|73.67 | 61.17 |51.91|
34
+ | 0.5b | arco |26.17|**37.29**|**62.88**|74.37| 62.27 |**52.60**|
35
  #### supporters
36
 
37
  <a href="https://ko-fi.com/appvoid" target="_blank"><img src="https://cdn.buymeacoffee.com/buttons/v2/default-yellow.png" alt="Buy Me A Coffee" style="height: 34px !important; margin-top: -4px;width: 128px !important; filter: contrast(2) grayscale(100%) brightness(100%);" ></a>