Adding the Open Portuguese LLM Leaderboard Evaluation Results

This is an automated PR created with https://huggingface.co/spaces/eduagarcia-temp/portuguese-leaderboard-results-to-modelcard

The purpose of this PR is to add evaluation results from the [🚀 Open Portuguese LLM Leaderboard](https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard) to your model card.

If you encounter any issues, please report them to https://huggingface.co/spaces/eduagarcia-temp/portuguese-leaderboard-results-to-modelcard/discussions

Files changed (1) hide show

README.md +23 -4

README.md CHANGED Viewed

@@ -1,14 +1,14 @@
 ---
-base_model: google/gemma-2-9b-it
 tags:
 - alignment-handbook
 - generated_from_trainer
 datasets:
 - princeton-nlp/gemma2-ultrafeedback-armorm
 model-index:
-- name: princeton-nlp/gemma-2-9b-it-SimPO
   results: []
-license: mit
 ---
 # gemma-2-9b-it-SimPO Model Card
@@ -135,4 +135,23 @@ ArmoRM paper:
   journal={arXiv preprint arXiv:2406.12845},
   year={2024}
 }
-```

 ---
+license: mit
 tags:
 - alignment-handbook
 - generated_from_trainer
+base_model: google/gemma-2-9b-it
 datasets:
 - princeton-nlp/gemma2-ultrafeedback-armorm
 model-index:
+- name: princeton-nlp/gemma-2-9b-it-SimPO
   results: []
 ---
 # gemma-2-9b-it-SimPO Model Card
   journal={arXiv preprint arXiv:2406.12845},
   year={2024}
 }
+```
+# Open Portuguese LLM Leaderboard Evaluation Results
+Detailed results can be found [here](https://huggingface.co/datasets/eduagarcia-temp/llm_pt_leaderboard_raw_results/tree/main/princeton-nlp/gemma-2-9b-it-SimPO) and on the [🚀 Open Portuguese LLM Leaderboard](https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard)
+|          Metric          |  Value  |
+|--------------------------|---------|
+|Average                   |**73.28**|
+|ENEM Challenge (No Images)|    75.09|
+|BLUEX (No Images)         |    65.37|
+|OAB Exams                 |    54.21|
+|Assin2 RTE                |    93.82|
+|Assin2 STS                |    77.82|
+|FaQuAD NLI                |    70.45|
+|HateBR Binary             |    89.76|
+|PT Hate Speech Binary     |    66.68|
+|tweetSentBR               |    66.28|