A Gradient Merge of Stheno-P1 and Stheno-P2, using BlockMerge_Gradient using a script modified by @Vali to replace the tensor calculations with SLERP instead.

So far its pretty good in personal tests.

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	49.32
ARC (25-shot)	56.83
HellaSwag (10-shot)	81.7
MMLU (5-shot)	52.79
TruthfulQA (0-shot)	50.23
Winogrande (5-shot)	71.11
GSM8K (5-shot)	0.23
DROP (3-shot)	32.34

Downloads last month: 1,623

Collection including Sao10K/Stheno-1.3-L2-13B

Stheno

Collection

Variants of the Stheno series of Models I've done [L2-13B] • 16 items • Updated 28 days ago • 4