yentinglin
commited on
Commit
•
eb42e4e
1
Parent(s):
67b3552
Update README.md
Browse files
README.md
CHANGED
@@ -48,9 +48,6 @@ Llama-3-Taiwan-70B is a large language model finetuned for Traditional Mandarin
|
|
48 |
|
49 |
# Evaluation
|
50 |
|
51 |
-
![image/png](https://cdn-uploads.huggingface.co/production/uploads/5df9c78eda6d0311fd3d541f/rxf8ICdP_geS6Gc5cCazh.png)
|
52 |
-
|
53 |
-
|
54 |
Checkout [Open TW LLM Leaderboard](https://huggingface.co/spaces/yentinglin/open-tw-llm-leaderboard) for full and updated list.
|
55 |
|
56 |
| Model | [TMLU](https://arxiv.org/pdf/2403.20180) | Taiwan Truthful QA | [Legal Eval](https://huggingface.co/datasets/lianghsun/tw-legal-benchmark-v1) | [TW MT-Bench](https://huggingface.co/datasets/MediaTek-Research/TCEval-v2) | Long context | Function Calling | [TMMLU+](https://github.com/iKala/ievals) |
|
|
|
48 |
|
49 |
# Evaluation
|
50 |
|
|
|
|
|
|
|
51 |
Checkout [Open TW LLM Leaderboard](https://huggingface.co/spaces/yentinglin/open-tw-llm-leaderboard) for full and updated list.
|
52 |
|
53 |
| Model | [TMLU](https://arxiv.org/pdf/2403.20180) | Taiwan Truthful QA | [Legal Eval](https://huggingface.co/datasets/lianghsun/tw-legal-benchmark-v1) | [TW MT-Bench](https://huggingface.co/datasets/MediaTek-Research/TCEval-v2) | Long context | Function Calling | [TMMLU+](https://github.com/iKala/ievals) |
|