New discussion

flan-t5 evals failing

#898 opened about 14 hours ago by pszemraj

phi-3-small-128k MATH Lvl 5 is 0

#897 opened about 19 hours ago by huu-ontocord

model evaluation failed

2
#895 opened 2 days ago by thomas-yanxin

Three failed evaluations

6
#892 opened 4 days ago by Pretergeek

Failed requests

12
#888 opened 5 days ago by LiteAI-Team

model evaluation failed

7
#886 opened 8 days ago by MaziyarPanahi

Model fail, re-eval request 😊

3
#885 opened 8 days ago by dnhkng

Incorrect ifeval benchmark

2
#879 opened 13 days ago by DavidGF

Upvote to evaluate deepseek-coder-v2

2
#793 opened about 2 months ago by g1y5x3