Update README.md
Browse files
README.md
CHANGED
@@ -8,24 +8,25 @@ library_name: transformers
|
|
8 |
tags:
|
9 |
- juanako
|
10 |
- UNA
|
11 |
-
- Mistral
|
12 |
---
|
13 |
# Model Card for una-cybertron-7b-v1 (UNA: Uniform Neural Alignment)
|
14 |
|
15 |
We strike back, introducing **Cybertron 7B** a Mistral based model best on it's series. Trained on SFT, DPO and UNA (Unified Neural Alignment) on multiple datasets.
|
16 |
-
He scores **64.
|
|
|
|
|
17 |
|
18 |
| Model | Average | ARC (25-s) | HellaSwag (10-s) | MMLU (5-s) | TruthfulQA (MC) (0-s) | Winogrande (5-s) | GSM8K (5-s) |
|
19 |
| --- | --- | --- | --- | --- | --- | --- | --- |
|
20 |
| [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) | 60.97 | 59.98 | 83.31 | 64.16 | 42.15 | 78.37 | 37.83 |
|
21 |
| [perlthoughts/Chupacabra-7B-v2](https://huggingface.co/perlthoughts/Chupacabra-7B-v2) | 63.54 | 66.47 | 85.17 | 64.49 | 57.6 | 79.16 | 28.35 |
|
22 |
-
| [fblgit/una-cybertron-7b-v1](https://huggingface.co/fblgit/una-cybertron-7b-v1) | **64.
|
23 |
|
24 |
The model excels in mathematics, logic, reasoning, overall very smart.
|
25 |
|
26 |
## Model Details
|
27 |
|
28 |
-
Adiestrated with UNA: Uniform Neural Alignment technique (paper going out soon)
|
29 |
|
30 |
### Model Description
|
31 |
|
|
|
8 |
tags:
|
9 |
- juanako
|
10 |
- UNA
|
|
|
11 |
---
|
12 |
# Model Card for una-cybertron-7b-v1 (UNA: Uniform Neural Alignment)
|
13 |
|
14 |
We strike back, introducing **Cybertron 7B** a Mistral based model best on it's series. Trained on SFT, DPO and UNA (Unified Neural Alignment) on multiple datasets.
|
15 |
+
He scores **64.60** on HF LeaderTests (without DROP for now).
|
16 |
+
|
17 |
+
Once again scoring #1 at 2 December 2023:
|
18 |
|
19 |
| Model | Average | ARC (25-s) | HellaSwag (10-s) | MMLU (5-s) | TruthfulQA (MC) (0-s) | Winogrande (5-s) | GSM8K (5-s) |
|
20 |
| --- | --- | --- | --- | --- | --- | --- | --- |
|
21 |
| [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) | 60.97 | 59.98 | 83.31 | 64.16 | 42.15 | 78.37 | 37.83 |
|
22 |
| [perlthoughts/Chupacabra-7B-v2](https://huggingface.co/perlthoughts/Chupacabra-7B-v2) | 63.54 | 66.47 | 85.17 | 64.49 | 57.6 | 79.16 | 28.35 |
|
23 |
+
| [fblgit/una-cybertron-7b-v1](https://huggingface.co/fblgit/una-cybertron-7b-v1) | **64.60** | **68.17** | 85.14 | 62.07 | **63.98** | **80.9** | 27.34 |
|
24 |
|
25 |
The model excels in mathematics, logic, reasoning, overall very smart.
|
26 |
|
27 |
## Model Details
|
28 |
|
29 |
+
Adiestrated with UNA: Uniform Neural Alignment technique (paper going out soon).
|
30 |
|
31 |
### Model Description
|
32 |
|