palmer-002.5 / README.md
appvoid's picture
Update README.md
6fb8a9f verified
|
raw
history blame
577 Bytes
# palmer
palmer-003 focuses on reaching sota performance by MErging of Experts + fine-tuning, where each expert is consolidated into one model and finally is fine-tuned on useful textual data.
```
### Evaluation
ARC-C OBQA HellaSwag PIQA Winogrande Average
tinyllama | 0.3029 | 0.3600 | 0.5935 | 0.7329 | 0.5959 | 0.5170 |
palmer-002-2401 | 0.3311 | 0.3600 | 0.5981 | 0.7416 | 0.6006 | 0.5266 |
babbage-002 | 0.3285 | 0.3620 | 0.6380 | 0.7606 | 0.6085 | 0.5395 |
palmer-003 | 0.3370 | 0.3740 | 0.6128 | 0.7486 | 0.6535 | 0.5451 |
```