airesearch
/

WangchanLion7B

@@ -71,7 +71,7 @@ print(tokenizer.decode(output[0], skip_special_tokens=True))
 # Training Details
 ## Training Data
-Finetuning datasets are sourced from [LAION OIG chip2 and infill_dbpedia (Apache-2.0)](https://huggingface.co/datasets/laion/OIG), [DataBricks Dolly v2 (Apache-2.0)](https://github.com/databrickslabs/dolly), [OpenAI TL;DR (MIT)](https://github.com/openai/summarize-from-feedback), [Hello-SimpleAI HC3 (CC-BY SA)](https://huggingface.co/datasets/Hello-SimpleAI/HC3), [dolphin](https://huggingface.co/datasets/ehartford/dolphin), [iapp_wiki_qa_squad](https://huggingface.co/datasets/iapp_wiki_qa_squad) , [thaisum](https://huggingface.co/datasets/thaisum), [xlsum](https://huggingface.co/datasets/csebuetnlp/xlsum), [scb_mt_enth_2020](https://huggingface.co/datasets/scb_mt_enth_2020), han dataset, [xp3x](https://huggingface.co/datasets/Muennighoff/xP3x) and [Open-Platypus](https://huggingface.co/datasets/garage-bAInd/Open-Platypus).
 ## Training regime
 - QLoRA with 4 A100 (40GB)
@@ -81,17 +81,16 @@ We performed human and machine evaluations on XQuAD zero-shot and one-shot setti
 ## XQuAD
 |      Model     | Exact Match (Zero-shot) | F1 (Zero-shot) | Exact Match (One-shot) | F1 (One-shot) |
 |:--------------:|:-----------------------:|:--------------:|:----------------------:|:-------------:|
-| openthaigpt7B  |         18.5714         |     28.4002    |         30.4202        |    39.7556    |
-| SeaLLM7B       |            -            |        -       |            -           |     44.43     |
-| Typhoon-7b     |         23.8655         |      36.27     |       **46.7227**      |   **57.898**  |
-| WangchanLion7B |        **37.563**       |   **49.8432**  |         39.2437        |    51.0627    |
 ## iAPP Wiki QA
 |      Model     | Exact Match (Zero-shot) | F1 (Zero-shot) | Exact Match (One-shot) | F1 (One-shot) |
 |:--------------:|:-----------------------:|:--------------:|:----------------------:|:-------------:|
 | openthaigpt7B  |         22.0568         |     40.0696    |         31.3938        |    47.9775    |
 | SeaLLM7B       |          8.2544         |     34.4038    |         40.0541        |    58.2673    |
-| Typhoon-7b     |         27.3342         |     46.2938    |         43.3018        |    59.9434    |
 | WangchanLion7B |       **55.4804**       |   **67.9262**  |       **56.4276**      |  **68.8471**  |
 # What WangchanLion offers:

 # Training Details
 ## Training Data
+Finetuning datasets are sourced from [LAION OIG chip2 and infill_dbpedia (Apache-2.0)](https://huggingface.co/datasets/laion/OIG), [DataBricks Dolly v2 (Apache-2.0)](https://github.com/databrickslabs/dolly), [OpenAI TL;DR (MIT)](https://github.com/openai/summarize-from-feedback), [Hello-SimpleAI HC3 (CC-BY SA)](https://huggingface.co/datasets/Hello-SimpleAI/HC3), [dolphin](https://huggingface.co/datasets/ehartford/dolphin), [iapp_wiki_qa_squad](https://huggingface.co/datasets/iapp_wiki_qa_squad) , [thaisum](https://huggingface.co/datasets/thaisum), [xlsum](https://huggingface.co/datasets/csebuetnlp/xlsum), [scb_mt_enth_2020](https://huggingface.co/datasets/scb_mt_enth_2020), [han dataset](https://huggingface.co/datasets/pythainlp/han-instruct-dataset-v1.0), [xp3x](https://huggingface.co/datasets/Muennighoff/xP3x) and [Open-Platypus](https://huggingface.co/datasets/garage-bAInd/Open-Platypus).
 ## Training regime
 - QLoRA with 4 A100 (40GB)
 ## XQuAD
 |      Model     | Exact Match (Zero-shot) | F1 (Zero-shot) | Exact Match (One-shot) | F1 (One-shot) |
 |:--------------:|:-----------------------:|:--------------:|:----------------------:|:-------------:|
+| openthaigpt7B  |         18.57           |     28.40      |         30.42          |    39.76      |
+| SeaLLM7B       |            -            |        -       |            -           |    44.43      |
+| Typhoon-7b     |         -               |      -         |         34.46          |    **54.03**  |
+| WangchanLion7B |        **37.56**        |   49.8432      |         **39.2437**    |    51.0627    |
 ## iAPP Wiki QA
 |      Model     | Exact Match (Zero-shot) | F1 (Zero-shot) | Exact Match (One-shot) | F1 (One-shot) |
 |:--------------:|:-----------------------:|:--------------:|:----------------------:|:-------------:|
 | openthaigpt7B  |         22.0568         |     40.0696    |         31.3938        |    47.9775    |
 | SeaLLM7B       |          8.2544         |     34.4038    |         40.0541        |    58.2673    |
 | WangchanLion7B |       **55.4804**       |   **67.9262**  |       **56.4276**      |  **68.8471**  |
 # What WangchanLion offers: