Walmart-the-bag commited on
Commit
b23076b
β€’
1 Parent(s): 37ce1b3

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +13 -23
README.md CHANGED
@@ -111,16 +111,19 @@ model-index:
111
 
112
  This model is a fine-tuned version of [01-ai/Yi-6B-Chat](https://huggingface.co/01-ai/Yi-6B-Chat) on the alpaca_gpt4_en dataset.
113
 
114
- EVAL:
115
- | T | Model | Average ⬆️ | ARC | HellaSwag | MMLU | TruthfulQA | Winogrande | GSM8K |
116
- |---|-------|------------|-----|-----------|------|------------|------------|-------|
117
- | πŸ”Ά | Walmart-the-bag/Yi-6B-Infinity-Chat πŸ“‘ | 59.83 | 56.57 | 77.66 | 64.05 | 50.75 | 73.95 | 36.01 |
118
- | πŸ”Ά | HenryJJ/Instruct_Yi-6B_Dolly15K πŸ“‘ | 56.85 | 54.86 | 75.87 | 63.37 | 42.84 | 74.9 | 29.26 |
119
- | 🟒 | 01-ai/Yi-6B-200K πŸ“‘ | 56.76 | 53.75 | 75.57 | 64.65 | 41.56 | 73.64 | 31.39 |
120
- | 🟒 | 01-ai/Yi-6B-200K πŸ“‘ | 56.69 | 53.58 | 75.58 | 64.65 | 41.74 | 74.27 | 30.33 |
121
- | πŸ”Ά | HenryJJ/Instruct_Yi-6B_Dolly_CodeAlpaca πŸ“‘ | 56.11 | 53.16 | 75.3 | 63.06 | 41.42 | 75.37 | 28.35 |
122
- | πŸ”Ά | adamo1139/Yi-6B-200K-AEZAKMI-v2 πŸ“‘ | 54.93 | 52.99 | 71.2 | 63 | 46.79 | 70.48 | 25.09 |
123
- | 🟒 | 01-ai/Yi-6B πŸ“‘ | 54.08 | 55.55 | 76.57 | 64.11 | 41.96 | 74.19 | 12.13 |
 
 
 
124
 
125
 
126
  # ORIGINAL MODEL CARD:
@@ -735,16 +738,3 @@ via applications. All usage must adhere to the [Yi Series Models Community Licen
735
  For free commercial use, you only need to send an email to [get official commercial permission](https://www.lingyiwanwu.com/yi-license).
736
 
737
  <div align="right"> [ <a href="#building-the-next-generation-of-open-source-and-bilingual-llms">Back to top ⬆️ </a> ] </div>
738
- # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
739
- Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_Walmart-the-bag__Yi-6B-Infinity-Chat)
740
-
741
- | Metric |Value|
742
- |---------------------------------|----:|
743
- |Avg. |59.83|
744
- |AI2 Reasoning Challenge (25-Shot)|56.57|
745
- |HellaSwag (10-Shot) |77.66|
746
- |MMLU (5-Shot) |64.05|
747
- |TruthfulQA (0-shot) |50.75|
748
- |Winogrande (5-shot) |73.95|
749
- |GSM8k (5-shot) |36.01|
750
-
 
111
 
112
  This model is a fine-tuned version of [01-ai/Yi-6B-Chat](https://huggingface.co/01-ai/Yi-6B-Chat) on the alpaca_gpt4_en dataset.
113
 
114
+ # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
115
+ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_Walmart-the-bag__Yi-6B-Infinity-Chat)
116
+
117
+ | Metric |Value|
118
+ |---------------------------------|----:|
119
+ |Avg. |59.83|
120
+ |AI2 Reasoning Challenge (25-Shot)|56.57|
121
+ |HellaSwag (10-Shot) |77.66|
122
+ |MMLU (5-Shot) |64.05|
123
+ |TruthfulQA (0-shot) |50.75|
124
+ |Winogrande (5-shot) |73.95|
125
+ |GSM8k (5-shot) |36.01|
126
+
127
 
128
 
129
  # ORIGINAL MODEL CARD:
 
738
  For free commercial use, you only need to send an email to [get official commercial permission](https://www.lingyiwanwu.com/yi-license).
739
 
740
  <div align="right"> [ <a href="#building-the-next-generation-of-open-source-and-bilingual-llms">Back to top ⬆️ </a> ] </div>