Commit History

Update Qwen2.5-7B-Instruct_results.csv
45413ca

dh-mc commited on

7b 40-shot
b48fd62

inflaton commited on

Update Qwen2.5-0.5B-Instruct_results.csv
c8b677c

dh-mc commited on

0.5/1.5/3b results
d94ae74

dh-mc commited on

7b
a36def6

inflaton commited on

qwen2.5-1.5b
ceb9311

dh-mc commited on

qwen2.5-3b
e06d133

inflaton commited on

qwen2.5 3/7b fine-tuned
bd723f9

inflaton commited on

fix script
286c5d6

dh-mc commited on

Update Qwen2.5-3B-Instruct_results.csv
574ad95

dh-mc commited on

fix scripts
281682c

dh-mc commited on

qwen2.5-3b 30-shot
d9bccd7

dh-mc commited on

ready for tuning qwen2.5-72b
de489ee

dh-mc commited on

llama3.1-70b 30-shot
f06a1e9

inflaton commited on

ready for qwen2.5-7b
37fb5b2

dh-mc commited on

Update logical_reasoning_utils.py
0c7b7f6

dh-mc commited on

ready for qwen2.5
d5ab5d2

dh-mc commited on

qwen2.5-3b 0-shot
06dfa32

dh-mc commited on

updated scripts
959d8cc

dh-mc commited on

Qwen2.5 fine-tuned
f584ea4

inflaton commited on

tune qwen2.5
88d67f6

dh-mc commited on

o1-preview few-shots complete
6ff118c

inflaton commited on

llama3.1-70b 20-shot
96f3c1e

inflaton commited on

more o1 results
e7f34e0

inflaton commited on

more
489600e

inflaton commited on

o1-min 50-shot
3c4359a

inflaton commited on

mistral 5-shot
5158692

inflaton commited on

counted few-shot prompts for all models
a8683cf

dh-mc commited on

o1-preview 20-shot
0baa6cc

inflaton commited on

Update eval-mgtv-shots_4bit.sh
492d1d4

dh-mc commited on

log
fe51ea8

inflaton commited on

o1-preview 5-shot
f2a583b

inflaton commited on

o1-mini 5/20 shots results
9042941

inflaton commited on

try 5-shot for open source models
d2150e8

dh-mc commited on

o1-preview 0-shot
545719f

inflaton commited on

o1-mini 0-shot
16adfc9

inflaton commited on

o1-preview 10-shot
6838eea

inflaton commited on

ready to run 10-shots for 70/72B models
809e98c

dh-mc commited on

10-shot results ready for 7/8 B models
3db2ae5

dh-mc commited on

logs/internlm2_5-20b-chat_tune_and_few_shots.txt
d8cfffb

inflaton commited on

10-shot results
6bc1181

inflaton commited on

internlm 20b results
47d6ce1

inflaton commited on

0-shot notebook
5b276b0

dh-mc commited on

Create eval-mgtv-internlm-20b.sh
75c4663

dh-mc commited on

mistral 10-shot
33cd694

dh-mc commited on

rtx4090 0-shot
d028752

dh-mc commited on

ready for few shots eval
cf912f1

dh-mc commited on

claude 0-shot
397a2fa

inflaton commited on

added original data from MGTV challenge
5f9686b

dh-mc commited on

https://github.com/mazzzystar/TurtleBenchmark
444a581

dh-mc commited on