Muennighoff's picture
A
eb2a9c4
task,metric,value,err,version
anli_r1,acc,0.343,0.015019206922356951,0
anli_r2,acc,0.321,0.014770821817934649,0
anli_r3,acc,0.3375,0.013655897185463658,0
arc_challenge,acc,0.2738907849829352,0.013032004972989501,0
arc_challenge,acc_norm,0.302901023890785,0.013428241573185349,0
arc_easy,acc,0.6153198653198653,0.009983171707009011,0
arc_easy,acc_norm,0.5888047138047138,0.010096663811817681,0
boolq,acc,0.5840978593272171,0.00862046960400103,1
cb,acc,0.42857142857142855,0.06672848092813058,1
cb,f1,0.31094339622641504,,1
copa,acc,0.79,0.040936018074033256,0
hellaswag,acc,0.44612626966739694,0.004960732382255232,0
hellaswag,acc_norm,0.5825532762397929,0.004921300331285554,0
piqa,acc,0.7301414581066377,0.010356595421852199,0
piqa,acc_norm,0.7268770402611534,0.010395730264453262,0
rte,acc,0.5415162454873647,0.029992535385373314,0
sciq,acc,0.885,0.010093407594904617,0
sciq,acc_norm,0.843,0.011510146979230189,0
storycloze_2016,acc,0.7022982362373063,0.010573790208173062,0
winogrande,acc,0.5753749013417522,0.013891893150264227,0