Update README.md
Browse files
README.md
CHANGED
@@ -18,6 +18,8 @@ base_model:
|
|
18 |
|
19 |
# Daredevil-8B
|
20 |
|
|
|
|
|
21 |
Daredevil-8B is a merge of the following models using [LazyMergekit](https://colab.research.google.com/drive/1obulZ1ROXHjYLn6PPZJwRR6GzgQogxxb?usp=sharing):
|
22 |
* [nbeerbower/llama-3-stella-8B](https://huggingface.co/nbeerbower/llama-3-stella-8B)
|
23 |
* [Hastagaras/llama-3-8b-okay](https://huggingface.co/Hastagaras/llama-3-8b-okay)
|
@@ -29,6 +31,15 @@ Daredevil-8B is a merge of the following models using [LazyMergekit](https://col
|
|
29 |
* [flammenai/Mahou-1.1-llama3-8B](https://huggingface.co/flammenai/Mahou-1.1-llama3-8B)
|
30 |
* [KingNish/KingNish-Llama3-8b](https://huggingface.co/KingNish/KingNish-Llama3-8b)
|
31 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
32 |
## 𧩠Configuration
|
33 |
|
34 |
```yaml
|
|
|
18 |
|
19 |
# Daredevil-8B
|
20 |
|
21 |
+
**tl;dr: It looks like a successful merge**
|
22 |
+
|
23 |
Daredevil-8B is a merge of the following models using [LazyMergekit](https://colab.research.google.com/drive/1obulZ1ROXHjYLn6PPZJwRR6GzgQogxxb?usp=sharing):
|
24 |
* [nbeerbower/llama-3-stella-8B](https://huggingface.co/nbeerbower/llama-3-stella-8B)
|
25 |
* [Hastagaras/llama-3-8b-okay](https://huggingface.co/Hastagaras/llama-3-8b-okay)
|
|
|
31 |
* [flammenai/Mahou-1.1-llama3-8B](https://huggingface.co/flammenai/Mahou-1.1-llama3-8B)
|
32 |
* [KingNish/KingNish-Llama3-8b](https://huggingface.co/KingNish/KingNish-Llama3-8b)
|
33 |
|
34 |
+
## π Evaluation
|
35 |
+
|
36 |
+
| Model | Average | AGIEval | GPT4All | TruthfulQA | Bigbench |
|
37 |
+
| ------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | --------: | --------: | --------: | ---------: | --------: |
|
38 |
+
| [**mlabonne/Daredevil-8B**](https://huggingface.co/mlabonne/Daredevil-8B) [π](https://gist.github.com/mlabonne/080f9c5f153ea57a7ab7d932cf896f21) | **55.87** | **44.13** | **73.52** | **59.05** | **46.77** |
|
39 |
+
| [mlabonne/ChimeraLlama-3-8B-](https://huggingface.co/mlabonne/Chimera-8B) [π](https://gist.github.com/mlabonne/28d31153628dccf781b74f8071c7c7e4) | 51.58 | 39.12 | 71.81 | 52.4 | 42.98 |
|
40 |
+
| [meta-llama/Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct) [π](https://gist.github.com/mlabonne/8329284d86035e6019edb11eb0933628) | 51.34 | 41.22 | 69.86 | 51.65 | 42.64 |
|
41 |
+
| [meta-llama/Meta-Llama-3-8B](https://huggingface.co/meta-llama/Meta-Llama-3-8B) [π](https://gist.github.com/mlabonne/616b6245137a9cfc4ea80e4c6e55d847) | 45.42 | 31.1 | 69.95 | 43.91 | 36.7 |
|
42 |
+
|
43 |
## 𧩠Configuration
|
44 |
|
45 |
```yaml
|