Forrest Bao
commited on
Commit
•
9c966da
1
Parent(s):
402fb1d
fix typos in performance numbers
Browse files
README.md
CHANGED
@@ -68,10 +68,10 @@ The tables below compare the two models on the [AggreFact](https://arxiv.org/pdf
|
|
68 |
Table 1: Performance on AggreFact-SOTA
|
69 |
| model | Balanced Accuracy | F1 | Recall | Precision |
|
70 |
|:------------------------|---------:|-------:|-------:|----------:|
|
71 |
-
| HHEM-1.0 | 78.87% | 90.47% | 70.81% | 67.
|
72 |
| HHEM-2.1-Open | 76.55% | 66.77% | 68.48% | 65.13% |
|
73 |
-
| GPT-3.5-Turbo zero-shot | 72.19% | 60.88% | 58.48% | 63.
|
74 |
-
| GPT-4 06-13 zero-shot | 73.78% | 63.
|
75 |
|
76 |
Table 2: Performance on RAGTruth-Summ
|
77 |
| model | Balanced Accuracy | F1 | Recall | Precision |
|
|
|
68 |
Table 1: Performance on AggreFact-SOTA
|
69 |
| model | Balanced Accuracy | F1 | Recall | Precision |
|
70 |
|:------------------------|---------:|-------:|-------:|----------:|
|
71 |
+
| HHEM-1.0 | 78.87% | 90.47% | 70.81% | 67.27% |
|
72 |
| HHEM-2.1-Open | 76.55% | 66.77% | 68.48% | 65.13% |
|
73 |
+
| GPT-3.5-Turbo zero-shot | 72.19% | 60.88% | 58.48% | 63.49% |
|
74 |
+
| GPT-4 06-13 zero-shot | 73.78% | 63.87% | 53.03% | 80.28% |
|
75 |
|
76 |
Table 2: Performance on RAGTruth-Summ
|
77 |
| model | Balanced Accuracy | F1 | Recall | Precision |
|