openchat
/

openchat-3.5-1210

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

imone commited on Dec 14, 2023

Commit

08168b1

•

1 Parent(s): 92e6458

Update README.md

Files changed (1) hide show

README.md +11 -7

README.md CHANGED Viewed

@@ -229,15 +229,11 @@ All models are evaluated in chat mode (e.g. with the respective conversation tem
 *: Grok results are reported by [X.AI](https://x.ai/).
-<div>
-<h3>Massive Multitask Language Understanding in Chinese (CMMLU)</h3>
-5-shot:
 </div>
-| Models   | STEM  | Humanities | SocialSciences | Other | ChinaSpecific | Avg   |
-|----------|-------|------------|----------------|-------|---------------|-------|
-| ChatGPT  | 47.81 | 55.68      | 56.5           | 62.66 | 50.69         | 55.51 |
-| OpenChat | 38.7  | 45.99      | 48.32          | 50.23 | 43.27         | 45.85 |
 <div>
 <h3>Multi-Level Multi-Discipline Chinese Evaluation Suite (CEVAL)</h3>
@@ -248,6 +244,14 @@ All models are evaluated in chat mode (e.g. with the respective conversation tem
 | ChatGPT  | 54.4  | 52.9  | 61.8           | 50.9       | 53.6   |
 | OpenChat | 47.29 | 45.22 | 52.49          | 48.52      | 45.08  |
 <div align="center">
 <h2> Limitations </h2>

 *: Grok results are reported by [X.AI](https://x.ai/).
+<div align="center">
+<h2> 中文评估结果 / Chinese Evaluations </h2>
 </div>
+⚠️ Note that this model was not explicitly trained in Chinese (only < 0.1% of the data is in Chinese). 请注意本模型没有针对性训练中文（中文数据占比小于0.1%）。
 <div>
 <h3>Multi-Level Multi-Discipline Chinese Evaluation Suite (CEVAL)</h3>
 | ChatGPT  | 54.4  | 52.9  | 61.8           | 50.9       | 53.6   |
 | OpenChat | 47.29 | 45.22 | 52.49          | 48.52      | 45.08  |
+<div>
+<h3>Massive Multitask Language Understanding in Chinese (CMMLU, 5-shot)</h3>
+</div>
+| Models   | STEM  | Humanities | SocialSciences | Other | ChinaSpecific | Avg   |
+|----------|-------|------------|----------------|-------|---------------|-------|
+| ChatGPT  | 47.81 | 55.68      | 56.5           | 62.66 | 50.69         | 55.51 |
+| OpenChat | 38.7  | 45.99      | 48.32          | 50.23 | 43.27         | 45.85 |
 <div align="center">
 <h2> Limitations </h2>