logical-reasoning / notebooks

Commit History

Update 00_Data Analysis.ipynb
c3b03b8

dh-mc commited on

figures for CALM
fce143a

inflaton commited on

use lfs
b9a8297

dh-mc commited on

final results
c8eca2c

dh-mc commited on

fix bug in perf calc
11f2c15

dh-mc commited on

clean up
eb1782a

dh-mc commited on

clean mistral
7534ff6

dh-mc commited on

clean up data
0e5f859

dh-mc commited on

finalized results: openai, qwen2.5 3/7b, internlm 7b/7b-1m
863a809

dh-mc commited on

final few-shots
84958e4

dh-mc commited on

try claude 5-shot
20bd1d6

inflaton commited on

use LFS for notebooks
e2a2251

dh-mc commited on

ready for final run
8157c36

dh-mc commited on

o1-mini complete
474c1ca

inflaton commited on

o1-mini 40-shot
9cd0757

inflaton commited on

o1-mini 50-shot
f16c8b2

inflaton commited on

ready for final run
687f68b

dh-mc commited on

0.5b results
baca3cc

dh-mc commited on

0.5/1.5/3b results
d94ae74

dh-mc commited on

qwen2.5-1.5b
ceb9311

dh-mc commited on

qwen2.5-3b 30-shot
d9bccd7

dh-mc commited on

ready for tuning qwen2.5-72b
de489ee

dh-mc commited on

ready for qwen2.5-7b
37fb5b2

dh-mc commited on

ready for qwen2.5
d5ab5d2

dh-mc commited on

updated scripts
959d8cc

dh-mc commited on

o1-preview few-shots complete
6ff118c

inflaton commited on

more o1 results
e7f34e0

inflaton commited on

counted few-shot prompts for all models
a8683cf

dh-mc commited on

o1-preview 20-shot
0baa6cc

inflaton commited on

o1-preview 5-shot
f2a583b

inflaton commited on

o1-mini 5/20 shots results
9042941

inflaton commited on

try 5-shot for open source models
d2150e8

dh-mc commited on

o1-preview 0-shot
545719f

inflaton commited on

o1-mini 0-shot
16adfc9

inflaton commited on

o1-preview 10-shot
6838eea

inflaton commited on

ready to run 10-shots for 70/72B models
809e98c

dh-mc commited on

10-shot results ready for 7/8 B models
3db2ae5

dh-mc commited on

0-shot notebook
5b276b0

dh-mc commited on

ready for few shots eval
cf912f1

dh-mc commited on

claude 0-shot
397a2fa

inflaton commited on

added original data from MGTV challenge
5f9686b

dh-mc commited on

compare o1 vs gpt-4o
4cd13da

dh-mc commited on

o1-mini analyzed
f1b0a53

dh-mc commited on

o1-mini results
fd14581

inflaton commited on

openai batch
921fa92

dh-mc commited on

Create 04e_OpenAI_comparison.ipynb
2bb5512

dh-mc commited on

Update 04_Few-shot_Prompting_OpenAI.ipynb
8e678e8

dh-mc commited on

saved best results/metrics
573f5d1

dh-mc commited on

completed eval/analysis
468b88d

dh-mc commited on

openai zero-shot results
8b9bb19

inflaton commited on