logical-reasoning / data /openai_metrics.csv

Commit History

counted few-shot prompts for all models
a8683cf

dh-mc commited on

try 5-shot for open source models
d2150e8

dh-mc commited on

o1-mini analyzed
f1b0a53

dh-mc commited on

openai batch
921fa92

dh-mc commited on

completed eval/analysis
468b88d

dh-mc commited on