kobkrit commited on
Commit
9712961
โ€ข
1 Parent(s): 1ca08c5

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +12 -5
README.md CHANGED
@@ -11,17 +11,18 @@ tags:
11
  ---
12
 
13
  # ๐Ÿ‡น๐Ÿ‡ญ OpenThaiGPT 7b 1.0.0
14
- <img src="https://1173516064-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FvvbWvIIe82Iv1yHaDBC5%2Fuploads%2Fb8eiMDaqiEQL6ahbAY0h%2Fimage.png?alt=media&token=6fce78fd-2cca-4c0a-9648-bd5518e644ce
15
- https://openthaigpt.aieat.or.th/" width="200px">
16
 
17
- ๐Ÿ‡น๐Ÿ‡ญ OpenThaiGPT 7b Version 1.0.0-beta is a Thai language 7B-parameter LLaMA v2 Chat model finetuned to Thai instructions and extend more than 10,000 most popular Thai words vocabularies into LLM's dictionary for turbo speed.
18
 
19
  ## Features
20
- - State-of-the-Art Thai language LLM, Acheive the highest average score over all Thai opensource LLMs on 9 Thai language exams.
21
  - Multi-turn Conversation Support
22
  - Retrieval Augmented Generation (RAG) Support
23
 
24
- ## Benchmark
 
25
  | **Exams** | **OTG 7b (Aug 2023)** | **OTG 13b (Dec 2023)** | **OTG 7b (March 2024)** | **OTG 13b (March 2024)** | **OTG 70b (March 2024)** | **SeaLLM 7b v1** | **SeaLLM 7b v2** | **TyphoonGPT 7b** | **SeaLion 7b** | **WanchanGLM 7b** | **Sailor-7B-Chat** | **GPT3.5** | **GPT4** | **Gemini Pro** | **Gemini 1.5** | **Claude 3 Haiku** | **Claude 3 Sonnet** | **Claude 3 Opus** |
26
  |----------------------------------|-----------------------|------------------------|-------------------------|--------------------------|--------------------------|------------------|------------------|--------------------|----------------|-------------------|--------------------|------------|----------|----------------|----------------|--------------------|---------------------|-------------------|
27
  | **A-Level** | 17.50% | 34.17% | 25.00% | 30.83% | 45.83% | 18.33% | 34.17% | N/A | 21.67% | 17.50% | 40.00% | 38.33% | 65.83% | 56.67% | 55.83% | 58.33% | 59.17% | 77.50% |
@@ -35,6 +36,12 @@ https://openthaigpt.aieat.or.th/" width="200px">
35
  | **ONET M6** | 21.14% | 28.87% | 22.53% | 23.32% | 42.85% | 15.09% | 19.48% | N/A | 16.96% | 20.67% | 28.64% | 34.44% | 46.29% | 45.53% | 50.23% | 34.79% | 38.49% | 48.56% |
36
  | **Average Score** | 23.83% | 37.27% | 38.40% | 40.33% | 55.87% | 18.06% | 33.56% | N/A | 27.44% | 23.75% | 37.28% | 43.07% | 60.68% | 52.30% | 52.89% | 50.65% | 56.81% | 68.32% |
37
 
 
 
 
 
 
 
38
  ## Licenses
39
  **Source Code**: License Apache Software License 2.0.<br>
40
  **Weight**: Research and **Commercial uses**.<br>
 
11
  ---
12
 
13
  # ๐Ÿ‡น๐Ÿ‡ญ OpenThaiGPT 7b 1.0.0
14
+ ![OpenThaiGPT](https://1173516064-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FvvbWvIIe82Iv1yHaDBC5%2Fuploads%2Fb8eiMDaqiEQL6ahbAY0h%2Fimage.png?alt=media&token=6fce78fd-2cca-4c0a-9648-bd5518e644ce)
15
+ [More Info](https://openthaigpt.aieat.or.th/)
16
 
17
+ ๐Ÿ‡น๐Ÿ‡ญ OpenThaiGPT 7b Version 1.0.0-beta is a Thai language 7B-parameter LLaMA v2 Chat model finetuned to Thai instructions and extended with more than 10,000 most popular Thai words vocabularies into the LLM's dictionary for turbo speed.
18
 
19
  ## Features
20
+ - State-of-the-Art Thai language LLM, achieving the highest average score over all Thai opensource LLMs on 9 Thai language exams.
21
  - Multi-turn Conversation Support
22
  - Retrieval Augmented Generation (RAG) Support
23
 
24
+
25
+ ## Benchmark by Multiple Choices Exams
26
  | **Exams** | **OTG 7b (Aug 2023)** | **OTG 13b (Dec 2023)** | **OTG 7b (March 2024)** | **OTG 13b (March 2024)** | **OTG 70b (March 2024)** | **SeaLLM 7b v1** | **SeaLLM 7b v2** | **TyphoonGPT 7b** | **SeaLion 7b** | **WanchanGLM 7b** | **Sailor-7B-Chat** | **GPT3.5** | **GPT4** | **Gemini Pro** | **Gemini 1.5** | **Claude 3 Haiku** | **Claude 3 Sonnet** | **Claude 3 Opus** |
27
  |----------------------------------|-----------------------|------------------------|-------------------------|--------------------------|--------------------------|------------------|------------------|--------------------|----------------|-------------------|--------------------|------------|----------|----------------|----------------|--------------------|---------------------|-------------------|
28
  | **A-Level** | 17.50% | 34.17% | 25.00% | 30.83% | 45.83% | 18.33% | 34.17% | N/A | 21.67% | 17.50% | 40.00% | 38.33% | 65.83% | 56.67% | 55.83% | 58.33% | 59.17% | 77.50% |
 
36
  | **ONET M6** | 21.14% | 28.87% | 22.53% | 23.32% | 42.85% | 15.09% | 19.48% | N/A | 16.96% | 20.67% | 28.64% | 34.44% | 46.29% | 45.53% | 50.23% | 34.79% | 38.49% | 48.56% |
37
  | **Average Score** | 23.83% | 37.27% | 38.40% | 40.33% | 55.87% | 18.06% | 33.56% | N/A | 27.44% | 23.75% | 37.28% | 43.07% | 60.68% | 52.30% | 52.89% | 50.65% | 56.81% | 68.32% |
38
 
39
+ ### Benchmark Configuration
40
+ - Clearly instruct model to answer by select one of a possible choice and followed by an explanation.
41
+ - Zero shot only
42
+ - Tested on unseen test set only
43
+ - Detect a multi-choice answer on (A),(B),(C),(D),(E) at the beginning of the answer (First priority) and at the end of the answer (Second priority)
44
+
45
  ## Licenses
46
  **Source Code**: License Apache Software License 2.0.<br>
47
  **Weight**: Research and **Commercial uses**.<br>