What benchmark dataset is used for testing hallucination?
#2
by
zhiminy
- opened
hi, it's this: https://huggingface.co/spaces/vectara/Hallucination-evaluation-leaderboard
Thanks for your reply. Thus, it is indeed the CNN DM
dataset used for benchmarking the hallucination, right? Why not mention it somewhere in the documentation?