Update README.md
Browse files
README.md
CHANGED
@@ -44,6 +44,17 @@ Thai language multiple choice exams, Test on unseen test sets, Zero-shot learnin
|
|
44 |
|
45 |
(Updated on: 7 April 2024)
|
46 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
47 |
## Licenses
|
48 |
**Source Code**: License Apache Software License 2.0.<br>
|
49 |
**Weight**: Research and **Commercial uses**.<br>
|
|
|
44 |
|
45 |
(Updated on: 7 April 2024)
|
46 |
|
47 |
+
## Benchmark on M3Exam evaluated by an external party (Float16.cloud)
|
48 |
+
|
49 |
+
| **Models** | **ENGLISH (M3EXAM)** | **THAI (M3EXAM)** |
|
50 |
+
|---------------------|------------------|---------------|
|
51 |
+
| <b style="color:blue">OTG-7b</b> | <b style="color:blue">40.92 %</b> | <b style="color:blue">25.14 %</b> |
|
52 |
+
| **OTG-13b** | 53.69 % | 36.49 % |
|
53 |
+
| **OTG-70b** | <b>72.58 %</b> | <b>48.29 %</b> |
|
54 |
+
| GPT-3.5-turbo-0613* | - | 34.1 % |
|
55 |
+
| GPT-4-0613* | - | 56.0 % |
|
56 |
+
More information: https://blog.float16.cloud/the-first-70b-thai-llm/
|
57 |
+
|
58 |
## Licenses
|
59 |
**Source Code**: License Apache Software License 2.0.<br>
|
60 |
**Weight**: Research and **Commercial uses**.<br>
|