giraffe176 commited on
Commit
1d43d0c
1 Parent(s): 6851f8e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +120 -0
README.md CHANGED
@@ -10,6 +10,126 @@ tags:
10
  - mergekit
11
  - merge
12
  license: cc-by-nc-4.0
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
13
  ---
14
  # WestMaid_HermesMonarchv0.1
15
 
 
10
  - mergekit
11
  - merge
12
  license: cc-by-nc-4.0
13
+ model-index:
14
+ - name: WestLake_Noromaid_OpenHermes_neural-chatv0.1
15
+ results:
16
+ - task:
17
+ type: text-generation
18
+ name: Text Generation
19
+ dataset:
20
+ name: EQ-Bench
21
+ type: eq-bench
22
+ config: EQ-Bench
23
+ split: v2
24
+ args:
25
+ num_few_shot: 3
26
+ metrics:
27
+ - type: acc_norm
28
+ value: 77.19
29
+ name: normalized accuracy
30
+ source:
31
+ url: https://github.com/EQ-bench/EQ-Bench
32
+ name: EQ-Bench v2.1
33
+ - task:
34
+ type: text-generation
35
+ name: Text Generation
36
+ dataset:
37
+ name: AI2 Reasoning Challenge (25-Shot)
38
+ type: ai2_arc
39
+ config: ARC-Challenge
40
+ split: test
41
+ args:
42
+ num_few_shot: 25
43
+ metrics:
44
+ - type: acc_norm
45
+ value: 70.22
46
+ name: normalized accuracy
47
+ source:
48
+ url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=giraffe176/WestMaid_HermesMonarchv0.1
49
+ name: Open LLM Leaderboard
50
+ - task:
51
+ type: text-generation
52
+ name: Text Generation
53
+ dataset:
54
+ name: HellaSwag (10-Shot)
55
+ type: hellaswag
56
+ split: validation
57
+ args:
58
+ num_few_shot: 10
59
+ metrics:
60
+ - type: acc_norm
61
+ value: 87.42
62
+ name: normalized accuracy
63
+ source:
64
+ url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=giraffe176/WestMaid_HermesMonarchv0.1
65
+ name: Open LLM Leaderboard
66
+ - task:
67
+ type: text-generation
68
+ name: Text Generation
69
+ dataset:
70
+ name: MMLU (5-Shot)
71
+ type: cais/mmlu
72
+ config: all
73
+ split: test
74
+ args:
75
+ num_few_shot: 5
76
+ metrics:
77
+ - type: acc
78
+ value: 64.31
79
+ name: accuracy
80
+ source:
81
+ url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=giraffe176/WestMaid_HermesMonarchv0.1
82
+ name: Open LLM Leaderboard
83
+ - task:
84
+ type: text-generation
85
+ name: Text Generation
86
+ dataset:
87
+ name: TruthfulQA (0-shot)
88
+ type: truthful_qa
89
+ config: multiple_choice
90
+ split: validation
91
+ args:
92
+ num_few_shot: 0
93
+ metrics:
94
+ - type: mc2
95
+ value: 61.99
96
+ source:
97
+ url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=giraffe176/WestMaid_HermesMonarchv0.1
98
+ name: Open LLM Leaderboard
99
+ - task:
100
+ type: text-generation
101
+ name: Text Generation
102
+ dataset:
103
+ name: Winogrande (5-shot)
104
+ type: winogrande
105
+ config: winogrande_xl
106
+ split: validation
107
+ args:
108
+ num_few_shot: 5
109
+ metrics:
110
+ - type: acc
111
+ value: 82.16
112
+ name: accuracy
113
+ source:
114
+ url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=giraffe176/WestMaid_HermesMonarchv0.1
115
+ name: Open LLM Leaderboard
116
+ - task:
117
+ type: text-generation
118
+ name: Text Generation
119
+ dataset:
120
+ name: GSM8k (5-shot)
121
+ type: gsm8k
122
+ config: main
123
+ split: test
124
+ args:
125
+ num_few_shot: 5
126
+ metrics:
127
+ - type: acc
128
+ value: 69.6
129
+ name: accuracy
130
+ source:
131
+ url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=giraffe176/WestMaid_HermesMonarchv0.1
132
+ name: Open LLM Leaderboard
133
  ---
134
  # WestMaid_HermesMonarchv0.1
135