Update README.md
Browse files
README.md
CHANGED
@@ -23,7 +23,7 @@ tags:
|
|
23 |
|
24 |
ProX models are evaluated over 10 language model benchmarks in zero-shot setting.
|
25 |
|
26 |
-
| |
|
27 |
|-----------------------|-------|-------|-------|-----------|-------|-------|-------|-------|-------|-------|------|
|
28 |
| raw | 22.6% | 41.9% | 29.7% | 32.8% | 26.2% | 26.4% | 62.2% | 39.3% | 51.3% | 63.3% | 39.6 |
|
29 |
| ours | 25.9% | 47.5% | 29.2% | 36.7% | 28.1% | 30.2% | 64.6% | 38.0% | 51.7% | 71.4% | 42.3 |
|
|
|
23 |
|
24 |
ProX models are evaluated over 10 language model benchmarks in zero-shot setting.
|
25 |
|
26 |
+
| | ArC-c | ARC-e | CSQA | HellaS | MMLU | OBQA | PiQA | SIQA | WinoG | SciQ | AVG |
|
27 |
|-----------------------|-------|-------|-------|-----------|-------|-------|-------|-------|-------|-------|------|
|
28 |
| raw | 22.6% | 41.9% | 29.7% | 32.8% | 26.2% | 26.4% | 62.2% | 39.3% | 51.3% | 63.3% | 39.6 |
|
29 |
| ours | 25.9% | 47.5% | 29.2% | 36.7% | 28.1% | 30.2% | 64.6% | 38.0% | 51.7% | 71.4% | 42.3 |
|