Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
inflaton-ai
/
logical-reasoning
like
0
Build error
App
Files
Files
Community
1
Fetching metadata from the HF Docker repository...
c3b03b8
logical-reasoning
/
notebooks
Commit History
Update 00_Data Analysis.ipynb
c3b03b8
dh-mc
commited on
Oct 21, 2024
figures for CALM
fce143a
inflaton
commited on
Oct 21, 2024
use lfs
b9a8297
dh-mc
commited on
Oct 3, 2024
final results
c8eca2c
dh-mc
commited on
Sep 30, 2024
fix bug in perf calc
11f2c15
dh-mc
commited on
Sep 27, 2024
clean up
eb1782a
dh-mc
commited on
Sep 27, 2024
clean mistral
7534ff6
dh-mc
commited on
Sep 27, 2024
clean up data
0e5f859
dh-mc
commited on
Sep 27, 2024
finalized results: openai, qwen2.5 3/7b, internlm 7b/7b-1m
863a809
dh-mc
commited on
Sep 27, 2024
final few-shots
84958e4
dh-mc
commited on
Sep 26, 2024
try claude 5-shot
20bd1d6
inflaton
commited on
Sep 26, 2024
use LFS for notebooks
e2a2251
dh-mc
commited on
Sep 26, 2024
ready for final run
8157c36
dh-mc
commited on
Sep 26, 2024
o1-mini complete
474c1ca
inflaton
commited on
Sep 25, 2024
o1-mini 40-shot
9cd0757
inflaton
commited on
Sep 25, 2024
o1-mini 50-shot
f16c8b2
inflaton
commited on
Sep 25, 2024
ready for final run
687f68b
dh-mc
commited on
Sep 24, 2024
0.5b results
baca3cc
dh-mc
commited on
Sep 23, 2024
0.5/1.5/3b results
d94ae74
dh-mc
commited on
Sep 22, 2024
qwen2.5-1.5b
ceb9311
dh-mc
commited on
Sep 22, 2024
qwen2.5-3b 30-shot
d9bccd7
dh-mc
commited on
Sep 21, 2024
ready for tuning qwen2.5-72b
de489ee
dh-mc
commited on
Sep 21, 2024
ready for qwen2.5-7b
37fb5b2
dh-mc
commited on
Sep 21, 2024
ready for qwen2.5
d5ab5d2
dh-mc
commited on
Sep 21, 2024
updated scripts
959d8cc
dh-mc
commited on
Sep 21, 2024
o1-preview few-shots complete
6ff118c
inflaton
commited on
Sep 21, 2024
more o1 results
e7f34e0
inflaton
commited on
Sep 21, 2024
counted few-shot prompts for all models
a8683cf
dh-mc
commited on
Sep 20, 2024
o1-preview 20-shot
0baa6cc
inflaton
commited on
Sep 19, 2024
o1-preview 5-shot
f2a583b
inflaton
commited on
Sep 19, 2024
o1-mini 5/20 shots results
9042941
inflaton
commited on
Sep 19, 2024
try 5-shot for open source models
d2150e8
dh-mc
commited on
Sep 18, 2024
o1-preview 0-shot
545719f
inflaton
commited on
Sep 18, 2024
o1-mini 0-shot
16adfc9
inflaton
commited on
Sep 16, 2024
o1-preview 10-shot
6838eea
inflaton
commited on
Sep 16, 2024
ready to run 10-shots for 70/72B models
809e98c
dh-mc
commited on
Sep 16, 2024
10-shot results ready for 7/8 B models
3db2ae5
dh-mc
commited on
Sep 16, 2024
0-shot notebook
5b276b0
dh-mc
commited on
Sep 15, 2024
ready for few shots eval
cf912f1
dh-mc
commited on
Sep 14, 2024
claude 0-shot
397a2fa
inflaton
commited on
Sep 14, 2024
added original data from MGTV challenge
5f9686b
dh-mc
commited on
Sep 14, 2024
compare o1 vs gpt-4o
4cd13da
dh-mc
commited on
Sep 14, 2024
o1-mini analyzed
f1b0a53
dh-mc
commited on
Sep 13, 2024
o1-mini results
fd14581
inflaton
commited on
Sep 13, 2024
openai batch
921fa92
dh-mc
commited on
Sep 13, 2024
Create 04e_OpenAI_comparison.ipynb
2bb5512
dh-mc
commited on
Sep 13, 2024
Update 04_Few-shot_Prompting_OpenAI.ipynb
8e678e8
dh-mc
commited on
Sep 12, 2024
saved best results/metrics
573f5d1
dh-mc
commited on
Sep 12, 2024
completed eval/analysis
468b88d
dh-mc
commited on
Sep 12, 2024
openai zero-shot results
8b9bb19
inflaton
commited on
Sep 10, 2024
Previous
1
2
Next