kyujinpy
/

Ko-PlatYi-6B-gu

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

kyujinpy commited on Dec 4, 2023

Commit

bc972ad

•

1 Parent(s): 2ee3cc2

Upload README.md

Files changed (1) hide show

README.md +19 -3

README.md CHANGED Viewed

@@ -34,15 +34,31 @@ Github: [Coming soon...]
 # **Model Benchmark**
-## Open leaderboard
-- Follow up as [link](https://huggingface.co/spaces/upstage/open-ko-llm-leaderboard).
 | Model | Average | ARC | HellaSwag | MMLU | TruthfulQA | CommonGen-V2 |
 | --- | --- | --- | --- | --- | --- | --- |
 | **Ko-PlatYi-6B-gu** | NaN | NaN | NaN | NaN | NaN | NaN |
 | Ko-PlatYi-6B | NaN | NaN | NaN | NaN | NaN | NaN |
 | Yi-Ko-6B | 48.79 | 41.04 | 53.39 | 46.28 | 41.64 | 61.63 |
 # Implementation Code
 ```python
 ### KO-Platypus

 # **Model Benchmark**
+## Open leaderboard
+> Follow up as [link](https://huggingface.co/spaces/upstage/open-ko-llm-leaderboard).
 | Model | Average | ARC | HellaSwag | MMLU | TruthfulQA | CommonGen-V2 |
 | --- | --- | --- | --- | --- | --- | --- |
+| Ko-PlatYi-6B-O | NaN | NaN | NaN | NaN | NaN | NaN |
+| Ko-PlatYi-6B-kiwi | NaN | NaN | NaN | NaN | NaN | NaN |
 | **Ko-PlatYi-6B-gu** | NaN | NaN | NaN | NaN | NaN | NaN |
 | Ko-PlatYi-6B | NaN | NaN | NaN | NaN | NaN | NaN |
 | Yi-Ko-6B | 48.79 | 41.04 | 53.39 | 46.28 | 41.64 | 61.63 |
+---
+## AI-Harness Evaluation
+> AI-Harness evaluation; [link](https://github.com/Beomi/ko-lm-evaluation-harness)
+| Model | BoolQ | Copa | HellaSwag | Sentineg |
+| --- | --- | --- | --- | --- |
+|  | *Zero-shot* ||||
+| Ko-PlatYi-6B-O | 0.3343 | 0.7687 | 0.4833 | 0.5794 |
+| Ko-PlatYi-6B-kiwi | 0.3343 | 0.7665 | 0.4746 | **0.6248** |
+| **Ko-PlatYi-6B-gu** | **0.7077** | **0.7696** | 0.4797 | 0.3979 |
+| Ko-PlatYi-6B | 0.3343 | 0.7684 | **0.4917** | 0.5226 |
+| Yi-Ko-6B | **0.7070** | 0.7696 | **0.5009** | 0.4044 |
+---
 # Implementation Code
 ```python
 ### KO-Platypus