jaspercatapang leaderboard-pr-bot commited on
Commit
0d290a6
1 Parent(s): 1b13331

Adding Evaluation Results (#2)

Browse files

- Adding Evaluation Results (7e51734846d69f9a6dc88f895271226c292c572a)


Co-authored-by: Open LLM Leaderboard PR Bot <leaderboard-pr-bot@users.noreply.huggingface.co>

Files changed (1) hide show
  1. README.md +14 -1
README.md CHANGED
@@ -72,4 +72,17 @@ For additional information or inquiries about FinOPT-Franklin, please contact th
72
  FinOPT-Franklin is an AI language model trained by Maya Philippines. It is provided "as is" without warranty of any kind, express or implied. The model developers and Maya Philippines shall not be liable for any direct or indirect damages arising from the use of this model.
73
 
74
  ## Acknowledgments
75
- The development of FinOPT-Franklin was made possible by Maya Philippines and the curation and creation of the financial question-answering dataset.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
72
  FinOPT-Franklin is an AI language model trained by Maya Philippines. It is provided "as is" without warranty of any kind, express or implied. The model developers and Maya Philippines shall not be liable for any direct or indirect damages arising from the use of this model.
73
 
74
  ## Acknowledgments
75
+ The development of FinOPT-Franklin was made possible by Maya Philippines and the curation and creation of the financial question-answering dataset.
76
+ # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
77
+ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_MayaPH__FinOPT-Franklin)
78
+
79
+ | Metric | Value |
80
+ |-----------------------|---------------------------|
81
+ | Avg. | 25.54 |
82
+ | ARC (25-shot) | 27.73 |
83
+ | HellaSwag (10-shot) | 24.91 |
84
+ | MMLU (5-shot) | 23.12 |
85
+ | TruthfulQA (0-shot) | 52.4 |
86
+ | Winogrande (5-shot) | 50.51 |
87
+ | GSM8K (5-shot) | 0.0 |
88
+ | DROP (3-shot) | 0.1 |