jarodrigues commited on
Commit
098f75b
1 Parent(s): f0d3318

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -129,9 +129,9 @@ For testing, we reserved the translated datasets MRPC (similarity) and RTE (infe
129
 
130
  | Model | MRPC (F1) | RTE (F1) | COPA (F1) |
131
  |--------------------------|----------------|----------------|-----------|
132
- | **Gervásio 7B PTBR** | **0.7822** | **0.8321** | 0.2134 |
133
- | **LLaMA-2** | 0.0369 | 0.0516 | 0.4867 |
134
- | **LLaMA-2 Chat** | 0.5432 | 0.3807 | **0.5493**|
135
  <br>
136
 
137
  For further testing our decoder, in addition to the testing data described above, we also reused some of the datasets that had been resorted for American Portuguese to test the state-of-the-art Sabiá model and that were originally developed with materials from Portuguese: ASSIN2 RTE (entailment) and ASSIN2 STS (similarity), BLUEX (question answering), ENEM 2022 (question answering) and FaQuAD (extractive question-answering).
 
129
 
130
  | Model | MRPC (F1) | RTE (F1) | COPA (F1) |
131
  |--------------------------|----------------|----------------|-----------|
132
+ | **Gervásio 7B PTBR** | **0.7822** | **0.8321** | 0.2134 |
133
+ | **LLaMA-2 (EN)** | 0.0369 | 0.0516 | 0.4867 |
134
+ | **LLaMA-2 Chat (EN)** | 0.5432 | 0.3807 | **0.5493**|
135
  <br>
136
 
137
  For further testing our decoder, in addition to the testing data described above, we also reused some of the datasets that had been resorted for American Portuguese to test the state-of-the-art Sabiá model and that were originally developed with materials from Portuguese: ASSIN2 RTE (entailment) and ASSIN2 STS (similarity), BLUEX (question answering), ENEM 2022 (question answering) and FaQuAD (extractive question-answering).