Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
inflaton-ai
/
logical-reasoning
like
0
Build error
App
Files
Files
Community
1
Fetching metadata from the HF Docker repository...
6838eea
logical-reasoning
Commit History
o1-preview 10-shot
6838eea
inflaton
commited on
Sep 16, 2024
ready to run 10-shots for 70/72B models
809e98c
dh-mc
commited on
Sep 16, 2024
10-shot results ready for 7/8 B models
3db2ae5
dh-mc
commited on
Sep 16, 2024
logs/internlm2_5-20b-chat_tune_and_few_shots.txt
d8cfffb
inflaton
commited on
Sep 16, 2024
10-shot results
6bc1181
inflaton
commited on
Sep 15, 2024
internlm 20b results
47d6ce1
inflaton
commited on
Sep 15, 2024
0-shot notebook
5b276b0
dh-mc
commited on
Sep 15, 2024
Create eval-mgtv-internlm-20b.sh
75c4663
dh-mc
commited on
Sep 15, 2024
mistral 10-shot
33cd694
dh-mc
commited on
Sep 15, 2024
rtx4090 0-shot
d028752
dh-mc
commited on
Sep 15, 2024
ready for few shots eval
cf912f1
dh-mc
commited on
Sep 14, 2024
claude 0-shot
397a2fa
inflaton
commited on
Sep 14, 2024
added original data from MGTV challenge
5f9686b
dh-mc
commited on
Sep 14, 2024
https://github.com/mazzzystar/TurtleBenchmark
444a581
dh-mc
commited on
Sep 14, 2024
compare o1 vs gpt-4o
4cd13da
dh-mc
commited on
Sep 14, 2024
o1-mini analyzed
f1b0a53
dh-mc
commited on
Sep 13, 2024
o1-mini results
fd14581
inflaton
commited on
Sep 13, 2024
LogiQA2.0 dataset
bf13772
dh-mc
commited on
Sep 13, 2024
openai batch
921fa92
dh-mc
commited on
Sep 13, 2024
Create 04e_OpenAI_comparison.ipynb
2bb5512
dh-mc
commited on
Sep 13, 2024
internlm_v2 results
83818dc
inflaton
commited on
Sep 13, 2024
internlm2_5-7b-chat fine-tune results
e4bce5e
inflaton
commited on
Sep 13, 2024
added scripts/eval-mgtv-internlm_v2.sh
71dcee7
inflaton
commited on
Sep 13, 2024
Update 04_Few-shot_Prompting_OpenAI.ipynb
8e678e8
dh-mc
commited on
Sep 12, 2024
ready for fine-tuning internlm2_5-20b-chat
62c2b84
dh-mc
commited on
Sep 12, 2024
saved best results/metrics
573f5d1
dh-mc
commited on
Sep 12, 2024
completed eval/analysis
468b88d
dh-mc
commited on
Sep 12, 2024
qwen2-72b full results
6e932d8
inflaton
commited on
Sep 11, 2024
openai zero-shot results
8b9bb19
inflaton
commited on
Sep 10, 2024
Update eval_logical_reasoning_all_epochs.py
090acf8
dh-mc
commited on
Sep 10, 2024
change BATCH_SIZE to 1 for qwen2-72b eval
4c31851
dh-mc
commited on
Sep 10, 2024
open source LLM results almost done
5a8f8d2
dh-mc
commited on
Sep 10, 2024
llama3.1-70b done
5dc41da
inflaton
commited on
Sep 10, 2024
mistral updated
a9f4f1f
dh-mc
commited on
Sep 10, 2024
llama-3.1-70b wip
60dc2c4
inflaton
commited on
Sep 10, 2024
llama-3.1-70b wip
717ab95
inflaton
commited on
Sep 10, 2024
mistral wip
9129c41
dh-mc
commited on
Sep 10, 2024
llama3.1-70b wip
e5b5f58
inflaton
commited on
Sep 10, 2024
Update llm_utils.py
71af822
dh-mc
commited on
Sep 9, 2024
mistral complete
1e26971
dh-mc
commited on
Sep 9, 2024
llama3.1-70b wip
ff0dc02
inflaton
commited on
Sep 9, 2024
mistral wip
2f6ccd3
dh-mc
commited on
Sep 9, 2024
llama-3.1-8b results
fa0492a
dh-mc
commited on
Sep 9, 2024
Update eval-mgtv-qwen2_72b.sh
0b58370
dh-mc
commited on
Sep 9, 2024
clean up
629867b
dh-mc
commited on
Sep 9, 2024
done fine-tuning
72043da
inflaton
commited on
Sep 9, 2024
ready for eval
473e849
dh-mc
commited on
Sep 9, 2024
qwen2 72b 80% results
62df289
inflaton
commited on
Sep 9, 2024
tuning mistral cn
bbea107
inflaton
commited on
Sep 8, 2024
ready for bf16 tuning
e656f92
dh-mc
commited on
Sep 8, 2024
Previous
1
2
3
...
6
Next