RebeccaQian1 commited on
Commit
0d68660
1 Parent(s): d88ac0a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -73,7 +73,7 @@ We train on 2400 samples consisting of CovidQA, PubmedQA, DROP and RAGTruth samp
73
 
74
  ## Evaluation
75
 
76
- The model was evaluated on [PatronusAI/hallucination-evaluation-benchmark](https://huggingface.co/datasets/PatronusAI/hallucination-evaluation-benchmark).
77
 
78
  It outperforms GPT-3.5-Turbo, GPT-4-Turbo, GPT-4o and Claude Sonnet.
79
 
 
73
 
74
  ## Evaluation
75
 
76
+ The model was evaluated on [PatronusAI/halubench](https://huggingface.co/datasets/PatronusAI/halubench).
77
 
78
  It outperforms GPT-3.5-Turbo, GPT-4-Turbo, GPT-4o and Claude Sonnet.
79