Locutusque commited on
Commit
2947447
1 Parent(s): c12978b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +28 -1
README.md CHANGED
@@ -37,7 +37,34 @@ This model is intended for researchers and practitioners looking for a powerful
37
  The `Locutusque/Hyperion-2.0-Mistral-7B` model was fine-tuned on the Hyperion-v2.0 dataset, which amalgamates various datasets rich in diversity and complexity, including programming, medical texts, mathematical problems, and reasoning tasks.
38
 
39
  ## Evaluation Results
40
- Coming soon...
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
41
 
42
  ## How to Use
43
  ```python
 
37
  The `Locutusque/Hyperion-2.0-Mistral-7B` model was fine-tuned on the Hyperion-v2.0 dataset, which amalgamates various datasets rich in diversity and complexity, including programming, medical texts, mathematical problems, and reasoning tasks.
38
 
39
  ## Evaluation Results
40
+ 0-shot AGIEval
41
+ | Tasks |Version|Filter|n-shot| Metric |Value | |Stderr|
42
+ |---------------------------------|-------|------|-----:|--------|-----:|---|-----:|
43
+ |agieval_nous |N/A |none | 0|acc |0.3602|± |0.0929|
44
+ | | |none | 0|acc_norm|0.3342|± |0.0764|
45
+ | - agieval_aqua_rat | 1|none | 0|acc |0.2402|± |0.0269|
46
+ | | |none | 0|acc_norm|0.2441|± |0.0270|
47
+ | - agieval_logiqa_en | 1|none | 0|acc |0.2965|± |0.0179|
48
+ | | |none | 0|acc_norm|0.3226|± |0.0183|
49
+ | - agieval_lsat_ar | 1|none | 0|acc |0.2348|± |0.0280|
50
+ | | |none | 0|acc_norm|0.2000|± |0.0264|
51
+ | - agieval_lsat_lr | 1|none | 0|acc |0.3667|± |0.0214|
52
+ | | |none | 0|acc_norm|0.3373|± |0.0210|
53
+ | - agieval_lsat_rc | 1|none | 0|acc |0.4981|± |0.0305|
54
+ | | |none | 0|acc_norm|0.4089|± |0.0300|
55
+ | - agieval_sat_en | 1|none | 0|acc |0.6359|± |0.0336|
56
+ | | |none | 0|acc_norm|0.5777|± |0.0345|
57
+ | - agieval_sat_en_without_passage| 1|none | 0|acc |0.3883|± |0.0340|
58
+ | | |none | 0|acc_norm|0.3544|± |0.0334|
59
+ | - agieval_sat_math | 1|none | 0|acc |0.3500|± |0.0322|
60
+ | | |none | 0|acc_norm|0.2682|± |0.0299|
61
+
62
+ | Groups |Version|Filter|n-shot| Metric |Value | |Stderr|
63
+ |------------|-------|------|-----:|--------|-----:|---|-----:|
64
+ |agieval_nous|N/A |none | 0|acc |0.3602|± |0.0929|
65
+ | | |none | 0|acc_norm|0.3342|± |0.0764|
66
+
67
+ 5-shot AGIEval coming soon.
68
 
69
  ## How to Use
70
  ```python