Commit
•
c5ed2f6
1
Parent(s):
750be3e
Update README.md
Browse files
README.md
CHANGED
@@ -25,7 +25,7 @@ The datasets contain a mix of hand-annotated and synthetic data. The maximum seq
|
|
25 |
- **Developed by:** Patronus AI
|
26 |
- **License:** [https://llama.meta.com/llama3/license](https://llama.meta.com/llama3/license)
|
27 |
|
28 |
-
### Model Sources
|
29 |
|
30 |
<!-- Provide the basic links for the model. -->
|
31 |
|
@@ -73,7 +73,7 @@ We train on 2400 samples consisting of CovidQA, PubmedQA, DROP and RAGTruth samp
|
|
73 |
|
74 |
## Evaluation
|
75 |
|
76 |
-
The model was evaluated on [PatronusAI/
|
77 |
|
78 |
It outperforms GPT-3.5-Turbo, GPT-4-Turbo, GPT-4o and Claude Sonnet.
|
79 |
|
|
|
25 |
- **Developed by:** Patronus AI
|
26 |
- **License:** [https://llama.meta.com/llama3/license](https://llama.meta.com/llama3/license)
|
27 |
|
28 |
+
### Model Sources
|
29 |
|
30 |
<!-- Provide the basic links for the model. -->
|
31 |
|
|
|
73 |
|
74 |
## Evaluation
|
75 |
|
76 |
+
The model was evaluated on [PatronusAI/HaluBench](https://huggingface.co/datasets/PatronusAI/HaluBench).
|
77 |
|
78 |
It outperforms GPT-3.5-Turbo, GPT-4-Turbo, GPT-4o and Claude Sonnet.
|
79 |
|