Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
inflaton-ai
/
logical-reasoning
like
0
Build error
App
Files
Files
Community
1
Fetching metadata from the HF Docker repository...
a8683cf
logical-reasoning
/
data
6 contributors
History:
37 commits
dh-mc
counted few-shot prompts for all models
a8683cf
5 months ago
Llama3.1-70B-Chinese-Chat_metrics.csv
Safe
2.09 kB
completed eval/analysis
5 months ago
Llama3.1-70B-Chinese-Chat_results.csv
Safe
2.99 MB
ready to run 10-shots for 70/72B models
5 months ago
Llama3.1-70B-Chinese-Chat_shots_metrics.csv
Safe
246 Bytes
ready to run 10-shots for 70/72B models
5 months ago
Llama3.1-8B-Chinese-Chat_metrics.csv
Safe
1.92 kB
10-shot results ready for 7/8 B models
5 months ago
Llama3.1-8B-Chinese-Chat_results.csv
Safe
2.99 MB
10-shot results ready for 7/8 B models
5 months ago
Llama3.1-8B-Chinese-Chat_shots_metrics.csv
Safe
386 Bytes
10-shot results ready for 7/8 B models
5 months ago
Mistral-7B-v0.3-Chinese-Chat_metrics.csv
Safe
2.05 kB
10-shot results ready for 7/8 B models
5 months ago
Mistral-7B-v0.3-Chinese-Chat_results.csv
Safe
3.03 MB
10-shot results ready for 7/8 B models
5 months ago
Mistral-7B-v0.3-Chinese-Chat_shots_metrics.csv
Safe
430 Bytes
10-shot results ready for 7/8 B models
5 months ago
Qwen2-72B-Instruct_metrics.csv
Safe
1.8 kB
completed eval/analysis
5 months ago
Qwen2-72B-Instruct_results.csv
Safe
2.98 MB
ready to run 10-shots for 70/72B models
5 months ago
Qwen2-72B-Instruct_shots_metrics.csv
Safe
228 Bytes
ready to run 10-shots for 70/72B models
5 months ago
Qwen2-7B-Instruct_metrics.csv
Safe
1.69 kB
10-shot results ready for 7/8 B models
5 months ago
Qwen2-7B-Instruct_results.csv
Safe
3 MB
10-shot results ready for 7/8 B models
5 months ago
Qwen2-7B-Instruct_shots_metrics.csv
Safe
341 Bytes
10-shot results ready for 7/8 B models
5 months ago
all_model_token_counts.csv
Safe
5.54 kB
counted few-shot prompts for all models
5 months ago
anthropic_metrics.csv
Safe
190 Bytes
claude 0-shot
5 months ago
anthropic_results.csv
Safe
2.76 MB
claude 0-shot
5 months ago
best_metrics.csv
Safe
1.39 kB
try 5-shot for open source models
5 months ago
best_results.csv
Safe
2.97 MB
try 5-shot for open source models
5 months ago
gpt-4o-mini-10-shots_batch_results.jsonl
Safe
1.92 MB
openai batch
5 months ago
internlm2_5-20b-chat_metrics.csv
Safe
1.92 kB
10-shot results ready for 7/8 B models
5 months ago
internlm2_5-20b-chat_results.csv
Safe
3.56 MB
10-shot results ready for 7/8 B models
5 months ago
internlm2_5-20b-chat_shots_metrics.csv
Safe
204 Bytes
10-shot results ready for 7/8 B models
5 months ago
internlm2_5-7b-chat-1m_metrics.csv
Safe
1.89 kB
10-shot results ready for 7/8 B models
5 months ago
internlm2_5-7b-chat-1m_results.csv
Safe
2.99 MB
10-shot results ready for 7/8 B models
5 months ago
internlm2_5-7b-chat-1m_shots_metrics.csv
Safe
397 Bytes
10-shot results ready for 7/8 B models
5 months ago
internlm2_5-7b-chat_metrics.csv
Safe
1.78 kB
10-shot results ready for 7/8 B models
5 months ago
internlm2_5-7b-chat_results.csv
Safe
2.99 MB
10-shot results ready for 7/8 B models
5 months ago
internlm2_5-7b-chat_shots_metrics.csv
Safe
342 Bytes
10-shot results ready for 7/8 B models
5 months ago
openai_metrics.csv
Safe
2.52 kB
counted few-shot prompts for all models
5 months ago
openai_results.csv
Safe
3.19 MB
o1-preview 20-shot
5 months ago