Commit
•
6edba2d
1
Parent(s):
860a6f7
Update README.md
Browse files
README.md
CHANGED
@@ -36,10 +36,16 @@ This is the model card of a 🤗 transformers model that has been pushed on the
|
|
36 |
- **License:** cc-by-nc-sa-4.0
|
37 |
- **Finetuned from model:**: [Minerva-3B-base-v1.0](https://huggingface.co/sapienzanlp/Minerva-3B-base-v1.0), developed by [Sapienza NLP](https://nlp.uniroma1.it) in collaboration with [Future Artificial Intelligence Research (FAIR)](https://fondazione-fair.it/) and [CINECA](https://www.cineca.it/)
|
38 |
|
|
|
39 |
|
40 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
41 |
|
42 |
-
<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
|
43 |
### Sample Code
|
44 |
|
45 |
```python
|
|
|
36 |
- **License:** cc-by-nc-sa-4.0
|
37 |
- **Finetuned from model:**: [Minerva-3B-base-v1.0](https://huggingface.co/sapienzanlp/Minerva-3B-base-v1.0), developed by [Sapienza NLP](https://nlp.uniroma1.it) in collaboration with [Future Artificial Intelligence Research (FAIR)](https://fondazione-fair.it/) and [CINECA](https://www.cineca.it/)
|
38 |
|
39 |
+
## Evaluation
|
40 |
|
41 |
+
For a detailed comparison of model performance, check out the [Leaderboard for Italian Language Models](https://huggingface.co/spaces/FinancialSupport/open_ita_llm_leaderboard).
|
42 |
+
|
43 |
+
Here's a breakdown of the performance metrics:
|
44 |
+
| Metric | hellaswag_it acc_norm | arc_it acc_norm | m_mmlu_it 5-shot acc | Average |
|
45 |
+
|:----------------------------|:----------------------|:----------------|:---------------------|:--------|
|
46 |
+
| **Minerva-3B-Instruct-v1.0** | 0.5191 | 0.3174 | 0.2629 | 0.3664 |
|
47 |
+
| Minerva-3B-base-v1.0 | 0.5187 | 0.3045 | 0.2612 | 0,3614 |
|
48 |
|
|
|
49 |
### Sample Code
|
50 |
|
51 |
```python
|