Commit History

ADD: BoolQ, TurthfulQA (#5)
7e92c24

Cookize commited on

fix cmmlu prompt
488ef58

facat commited on

clean bbh
29eceda

facat commited on

fix math cornercase
32ae64f

facat commited on

output in dataset
84e1d00

facat commited on

update index
5ad9651

facat commited on

Merge remote-tracking branch 'origin/main'
277ec33

facat commited on

update index
1395a53

facat commited on

Update README.md
d865e4b

facat commited on

Update README.md
ac6dc93

facat commited on

Update README.md
79f8d04

facat commited on

update index
72ccff7

facat commited on

fix load
33af91b

facat commited on

upd
c1cde4c

facat commited on

update
72dba58

facat commited on

update drop
5ca9a91

facat commited on

fix logging
132574a

facat commited on

update
c6f1343

facat commited on

clean
0c75eca

facat commited on

fix async
360e3ac

facat commited on

utils as attr
08339c7

facat commited on

fixup! fix task
0f420dd

facat commited on

feat: async run
f21585c

facat commited on

update MATH
4e1024e

facat commited on

fix task
232b173

facat commited on

update
f2c1a54

facat commited on

!ref suite
3a8c0d0

facat commited on

refactor
9827786

facat commited on

fix math
6d6787f

facat commited on

update math
58f14b3

facat commited on

fix dataset in task
76eab85

facat commited on

add math
d13c0d8

facat commited on

FIX: extraction func of C-Eval; logging metrics (#3)
25e4875

facat Cookize commited on

update mt_bench
845a45a

facat commited on

update
33a6f85

facat commited on

fix mmlu
9199665

facat commited on

fix fewshot
075ef98

facat commited on

verbose mode
a034e31

facat commited on

add gsm8k
18cd4ae

facat commited on

add mmlu and cmmlu
be1543a

facat commited on

upd
044ed98

facat commited on

update
69b800b

facat commited on

refactor
4c7982b

facat commited on

fix name
c250b54

facat commited on

add suite
a6d7b1c

facat commited on

fix
e01a5f6

facat commited on

upd
8af54b8

facat commited on

initial commit
507319c

facat commited on