Update README.md
Browse files
README.md
CHANGED
@@ -60,12 +60,16 @@ The model is based on [`rinna/bilingual-gpt-neox-4b`](https://huggingface.co/rin
|
|
60 |
|
61 |
[Tianyu Zhao](https://huggingface.co/tianyuz) and [Kei Sawada](https://huggingface.co/keisawada)
|
62 |
|
|
|
|
|
|
|
|
|
63 |
---
|
64 |
|
65 |
# Benchmarking
|
66 |
|
67 |
-
Our evaluation experiments suggest that the bilingual-gpt-neox-4b-instruction-sft model performs slightly better than the previous [Japanese GPT-NeoX 3.6B PPO](https://huggingface.co/rinna/japanese-gpt-neox-3.6b-instruction-ppo) in Japanese tasks.
|
68 |
-
|
69 |
- *The 4-task average accuracy is based on results of JCommonsenseQA, JNLI, MARC-ja, and JSQuAD.*
|
70 |
- *The 6-task average accuracy is based on results of JCommonsenseQA, JNLI, MARC-ja, JSQuAD, XWinograd, and JAQKET-v2.*
|
71 |
|
|
|
60 |
|
61 |
[Tianyu Zhao](https://huggingface.co/tianyuz) and [Kei Sawada](https://huggingface.co/keisawada)
|
62 |
|
63 |
+
* **Release date**
|
64 |
+
|
65 |
+
August 2, 2023
|
66 |
+
|
67 |
---
|
68 |
|
69 |
# Benchmarking
|
70 |
|
71 |
+
Our evaluation experiments suggest that the bilingual-gpt-neox-4b-instruction-sft model performs slightly better than the previous [Japanese GPT-NeoX 3.6B PPO](https://huggingface.co/rinna/japanese-gpt-neox-3.6b-instruction-ppo) in Japanese tasks.
|
72 |
+
|
73 |
- *The 4-task average accuracy is based on results of JCommonsenseQA, JNLI, MARC-ja, and JSQuAD.*
|
74 |
- *The 6-task average accuracy is based on results of JCommonsenseQA, JNLI, MARC-ja, JSQuAD, XWinograd, and JAQKET-v2.*
|
75 |
|