Spaces:
Running
on
CPU Upgrade
Running
on
CPU Upgrade
How are Faithfulness and Factuality calculated?
2
#22 opened about 1 month ago
by
UjjwalP
How could #parameter of a model be 0?
2
#20 opened about 2 months ago
by
zhiminy
Why is the score for RACE so low?
1
#18 opened about 2 months ago
by
thangphan68
Adding German Faithfulness Detection Task
1
#16 opened 3 months ago
by
mtc
Adding SummEdits to leaderboard?
1
#12 opened 3 months ago
by
philippelaban
Adding tasks from the USB benchmark (for summarization)
1
#11 opened 3 months ago
by
kundank
Adding the Snowball Hallucination detection datasets
#9 opened 3 months ago
by
ofirpress
Longform QA
2
#8 opened 3 months ago
by
shehzaadzd
Metrics for hallucination detection for summarization.
4
#6 opened 4 months ago
by
rohitsaxena
Hello all!
#5 opened 4 months ago
by
pminervini