upskyy commited on
Commit
4096d3b
1 Parent(s): d052404

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +8 -1
README.md CHANGED
@@ -46,7 +46,14 @@ state of the art AI models and helping foster innovation for everyone.
46
 
47
  ## Benchmark Scores
48
 
49
- TBD
 
 
 
 
 
 
 
50
 
51
  ## Usage
52
 
 
46
 
47
  ## Benchmark Scores
48
 
49
+ We evaluated it internally using [LogicKor](https://github.com/instructkr/LogicKor) code. While the public LogicKor code is assessed as GPT-4, our internal evaluation was conducted as GPT-4o. Public scores will be added as they are released. The scores below include only 0-shot evaluations.
50
+
51
+ | Model | Math | Reasoning | Writing | Coding | Understanding | Grammar | Single ALL | Multi ALL | Overall |
52
+ |:---------:|:-----:|:------:|:-----:|:-----:|:----:|:-----:|:-----:|:-----:|:----:|
53
+ | [rtzr/ko-gemma-2-9b-it](https://huggingface.co/rtzr/ko-gemma-2-9b-it) | 8.14 / 7.29 | 8.86 / 8.14 | 8.14 / 9.00 | 8.57 / 9.29 | 8.86 / 9.43 | 4.86 / 4.29 | 7.90 | 7.90 | 7.90 |
54
+ | [MLP-KTLim/llama-3-Korean-Bllossom-8B](https://huggingface.co/MLP-KTLim/llama-3-Korean-Bllossom-8B) | 6.43 / 5.71 | 6.86 / 5.14 | 9.14 / 8.57 | 8.29 / 8.14 | 8.43 / 9.29 | 5.71 / 5.29 | 7.48 | 7.02 | 7.25 |
55
+ | [yanolja/EEVE-Korean-Instruct-10.8B-v1.0](https://huggingface.co/yanolja/EEVE-Korean-Instruct-10.8B-v1.0) | 5.57 / 4.29 | 8.14 / 5.14 | 8.29 / 6.29 | 6.43 / 7.86 | 9.29 / 8.57 | 6.57 / 3.71 | 7.38 | 5.98 | 6.68 |
56
+ | [allganize/Llama-3-Alpha-Ko-8B-Instruct](https://huggingface.co/allganize/Llama-3-Alpha-Ko-8B-Instruct) | 4.57 / 3.00 | 6.86 / 6.43 | 7.43 / 6.71 | 8.43 / 8.43| 7.71 / 8.71 | 6.71 / 4.43 | 6.95 | 6.29 | 6.62 |
57
 
58
  ## Usage
59