fblgit commited on
Commit
3e12f69
1 Parent(s): aee9280

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -0
README.md CHANGED
@@ -122,6 +122,11 @@ Here are some results:
122
  * Scores #6 in CoPa
123
  * Scores #2 in PiQA
124
  * Scores #9 in BoolQ
 
 
 
 
 
125
 
126
  Many evaluations were performed, but it behaves very balanced in multiple fields. Feel free to submit more evaluation results.
127
 
 
122
  * Scores #6 in CoPa
123
  * Scores #2 in PiQA
124
  * Scores #9 in BoolQ
125
+ | Model | Average ⬆️| ARC (25-s) ⬆️ | HellaSwag (10-s) ⬆️ | MMLU (5-s) ⬆️| TruthfulQA (MC) (0-s) ⬆️ | Winogrande (5-s) | GSM8K (5-s) | DROP (3-s) |
126
+ | --- | --- | --- | --- | --- | --- | --- | --- | --- |
127
+ |[mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) | 50.32 | 59.58 | 83.31 | 64.16 | 42.15 | 78.37 | 18.12 | 6.14 |
128
+ | [Intel/neural-chat-7b-v3-1](https://huggingface.co/Intel/neural-chat-7b-v3-1) | 59.0 | 66.21 | 83.64 | 62.37 | 59.65 | 78.14 | 19.56 | 43.84 |
129
+ | [fblgit/juanako-7b-UNA](https://huggingface.co/fblgit/juanako-7b-UNA) | **65.10** | **68.09** | **85.20** | 61.37 | **65.49** | 76.8 | **48.98** | **49.8** |
130
 
131
  Many evaluations were performed, but it behaves very balanced in multiple fields. Feel free to submit more evaluation results.
132