Update README.md
Browse files
README.md
CHANGED
@@ -70,11 +70,12 @@ print(response)
|
|
70 |
## Benchmarks
|
71 |
|
72 |
Nous Benchmark:
|
73 |
-
|
74 |
| Model |AGIEval|GPT4All|TruthfulQA|Bigbench|Average|
|
75 |
|---------------------------------------------------|------:|------:|---------:|-------:|------:|
|
76 |
|[Master-Yi-9B](https://huggingface.co/qnguyen3/Master-Yi-9B)| 43.55| 71.48| 48.54| 41.43| 51.25|
|
77 |
|
|
|
78 |
### AGIEval
|
79 |
| Task |Version| Metric |Value| |Stderr|
|
80 |
|------------------------------|------:|--------|----:|---|-----:|
|
@@ -152,11 +153,12 @@ Average score: 51.25%
|
|
152 |
```
|
153 |
|
154 |
OpenLLM Benchmark:
|
155 |
-
|
156 |
| Model |ARC |HellaSwag|MMLU |TruthfulQA|Winogrande|GSM8K|Average|
|
157 |
|---------------------------------------------------|---:|--------:|----:|---------:|---------:|----:|------:|
|
158 |
|[Master-Yi-9B](https://huggingface.co/qnguyen3/Master-Yi-9B)|61.6| 79.89|69.95| 48.59| 77.35|67.48| 67.48|
|
159 |
|
|
|
160 |
### ARC
|
161 |
| Task |Version| Metric | Value | |Stderr|
|
162 |
|-------------|------:|--------------------|-------------|---|------|
|
|
|
70 |
## Benchmarks
|
71 |
|
72 |
Nous Benchmark:
|
73 |
+
|
74 |
| Model |AGIEval|GPT4All|TruthfulQA|Bigbench|Average|
|
75 |
|---------------------------------------------------|------:|------:|---------:|-------:|------:|
|
76 |
|[Master-Yi-9B](https://huggingface.co/qnguyen3/Master-Yi-9B)| 43.55| 71.48| 48.54| 41.43| 51.25|
|
77 |
|
78 |
+
```
|
79 |
### AGIEval
|
80 |
| Task |Version| Metric |Value| |Stderr|
|
81 |
|------------------------------|------:|--------|----:|---|-----:|
|
|
|
153 |
```
|
154 |
|
155 |
OpenLLM Benchmark:
|
156 |
+
|
157 |
| Model |ARC |HellaSwag|MMLU |TruthfulQA|Winogrande|GSM8K|Average|
|
158 |
|---------------------------------------------------|---:|--------:|----:|---------:|---------:|----:|------:|
|
159 |
|[Master-Yi-9B](https://huggingface.co/qnguyen3/Master-Yi-9B)|61.6| 79.89|69.95| 48.59| 77.35|67.48| 67.48|
|
160 |
|
161 |
+
```
|
162 |
### ARC
|
163 |
| Task |Version| Metric | Value | |Stderr|
|
164 |
|-------------|------:|--------------------|-------------|---|------|
|