1TuanPham commited on
Commit
38eb13d
1 Parent(s): 7ecf93d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +12 -12
README.md CHANGED
@@ -377,6 +377,18 @@ A bad way to visualize i know...
377
 
378
  Our model currently sits at TOP-5 on the VMLU benchmark
379
 
 
 
 
 
 
 
 
 
 
 
 
 
380
  ## Citation
381
 
382
  <!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
@@ -389,16 +401,4 @@ Our model currently sits at TOP-5 on the VMLU benchmark
389
  }
390
  ```
391
 
392
- # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
393
- Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_1TuanPham__T-Llama)
394
-
395
- | Metric |Value|
396
- |---------------------------------|----:|
397
- |Avg. |54.34|
398
- |AI2 Reasoning Challenge (25-Shot)|54.18|
399
- |HellaSwag (10-Shot) |76.48|
400
- |MMLU (5-Shot) |47.98|
401
- |TruthfulQA (0-shot) |46.47|
402
- |Winogrande (5-shot) |71.27|
403
- |GSM8k (5-shot) |29.64|
404
 
 
377
 
378
  Our model currently sits at TOP-5 on the VMLU benchmark
379
 
380
+ # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
381
+ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_1TuanPham__T-Llama)
382
+ | Metric |Value|
383
+ |---------------------------------|----:|
384
+ |Avg. |54.34|
385
+ |AI2 Reasoning Challenge (25-Shot)|54.18|
386
+ |HellaSwag (10-Shot) |76.48|
387
+ |MMLU (5-Shot) |47.98|
388
+ |TruthfulQA (0-shot) |46.47|
389
+ |Winogrande (5-shot) |71.27|
390
+ |GSM8k (5-shot) |29.64|
391
+
392
  ## Citation
393
 
394
  <!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
 
401
  }
402
  ```
403
 
 
 
 
 
 
 
 
 
 
 
 
 
404