Update README.md
Browse files
README.md
CHANGED
|
@@ -28,12 +28,29 @@ Training methodology ... is a bit of a mess, trying out different things.
|
|
| 28 |
I'm adding the datasets used at any point, but I don't think replicating the recipe is doable or sensible.
|
| 29 |
|
| 30 |
Original upscale at Lambent/danube2-upscale-1, duplicating layers 16-21. Various training methods attempted to repair.
|
| 31 |
-
|
| 32 |
|
| 33 |
| Model |AGIEval|GPT4All|TruthfulQA|Bigbench|Average|
|
| 34 |
|-------------------------------------------------------------------------|------:|------:|---------:|-------:|------:|
|
| 35 |
|[danube2-upscale-1.7](https://huggingface.co/Lambent/danube2-upscale-1.7)| 27.97| 62.16| 42.2| 32.2| 41.13|
|
| 36 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 37 |
## Merge Details
|
| 38 |
### Merge Method
|
| 39 |
|
|
|
|
| 28 |
I'm adding the datasets used at any point, but I don't think replicating the recipe is doable or sensible.
|
| 29 |
|
| 30 |
Original upscale at Lambent/danube2-upscale-1, duplicating layers 16-21. Various training methods attempted to repair.
|
| 31 |
+
Linear merge is of the 4 that were at least 90% parseable by the EQ-Bench benchmark.
|
| 32 |
|
| 33 |
| Model |AGIEval|GPT4All|TruthfulQA|Bigbench|Average|
|
| 34 |
|-------------------------------------------------------------------------|------:|------:|---------:|-------:|------:|
|
| 35 |
|[danube2-upscale-1.7](https://huggingface.co/Lambent/danube2-upscale-1.7)| 27.97| 62.16| 42.2| 32.2| 41.13|
|
| 36 |
|
| 37 |
+
| Model |EQ-Bench|Average|
|
| 38 |
+
|-------------------------------------------------------------------------|-------:|------:|
|
| 39 |
+
|[danube2-upscale-1.7](https://huggingface.co/Lambent/danube2-upscale-1.7)| 15.52| 15.52|
|
| 40 |
+
|
| 41 |
+
### EQ-Bench
|
| 42 |
+
| Task |Version| Metric | Value | |Stderr|
|
| 43 |
+
|--------|------:|-----------------------------|--------|---|------|
|
| 44 |
+
|eq_bench| 2.1|eqbench,none | 15.52| | |
|
| 45 |
+
| | |eqbench_stderr,none | 2.77| | |
|
| 46 |
+
| | |percent_parseable,none | 100| | |
|
| 47 |
+
| | |percent_parseable_stderr,none| 0| | |
|
| 48 |
+
| | |alias |eq_bench| | |
|
| 49 |
+
|
| 50 |
+
Average: 15.52%
|
| 51 |
+
|
| 52 |
+
Average score: 15.52%
|
| 53 |
+
|
| 54 |
## Merge Details
|
| 55 |
### Merge Method
|
| 56 |
|