openchat
/

openchat-3.5-1210

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

alpayariyak commited on Dec 13, 2023

Commit

f592fa6

•

1 Parent(s): b93ddae

Update README.md

Files changed (1) hide show

README.md +9 -0

README.md CHANGED Viewed

@@ -240,6 +240,15 @@ All models are evaluated in chat mode (e.g. with the respective conversation tem
 | ChatGPT  | 47.81 | 55.68      | 56.5           | 62.66 | 50.69         | 55.51 |
 | OpenChat | 38.7  | 45.99      | 48.32          | 50.23 | 43.27         | 45.85 |
 <div align="center">
 <h2> Limitations </h2>

 | ChatGPT  | 47.81 | 55.68      | 56.5           | 62.66 | 50.69         | 55.51 |
 | OpenChat | 38.7  | 45.99      | 48.32          | 50.23 | 43.27         | 45.85 |
+<div>
+<h3>Multi-Level Multi-Discipline Chinese Evaluation Suite (CEVAL)</h3>
+<div>
+| Model    | Avg   | STEM  | Social Science | Humanities | Others |
+|----------|-------|-------|----------------|------------|--------|
+| ChatGPT  | 54.4  | 52.9  | 61.8           | 50.9       | 53.6   |
+| OpenChat | 47.29 | 45.22 | 52.49          | 48.52      | 45.08  |
 <div align="center">
 <h2> Limitations </h2>