RebeccaQian1 commited on
Commit
c5ed2f6
1 Parent(s): 750be3e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -25,7 +25,7 @@ The datasets contain a mix of hand-annotated and synthetic data. The maximum seq
25
  - **Developed by:** Patronus AI
26
  - **License:** [https://llama.meta.com/llama3/license](https://llama.meta.com/llama3/license)
27
 
28
- ### Model Sources [optional]
29
 
30
  <!-- Provide the basic links for the model. -->
31
 
@@ -73,7 +73,7 @@ We train on 2400 samples consisting of CovidQA, PubmedQA, DROP and RAGTruth samp
73
 
74
  ## Evaluation
75
 
76
- The model was evaluated on [PatronusAI/hallucination-evaluation-benchmark](https://huggingface.co/datasets/PatronusAI/hallucination-evaluation-benchmark).
77
 
78
  It outperforms GPT-3.5-Turbo, GPT-4-Turbo, GPT-4o and Claude Sonnet.
79
 
 
25
  - **Developed by:** Patronus AI
26
  - **License:** [https://llama.meta.com/llama3/license](https://llama.meta.com/llama3/license)
27
 
28
+ ### Model Sources
29
 
30
  <!-- Provide the basic links for the model. -->
31
 
 
73
 
74
  ## Evaluation
75
 
76
+ The model was evaluated on [PatronusAI/HaluBench](https://huggingface.co/datasets/PatronusAI/HaluBench).
77
 
78
  It outperforms GPT-3.5-Turbo, GPT-4-Turbo, GPT-4o and Claude Sonnet.
79