Spaces:
Restarting
on
CPU Upgrade
Restarting
on
CPU Upgrade
💬 Discussion thread: Model contamination techniques 💬
pinned
33
#472 opened 5 months ago
by
clefourrier
Future feature: system prompt and chat support
pinned
21
#459 opened 5 months ago
by
clefourrier
💬 Discussion thread: Model scores and model performances 💬
pinned
70
#265 opened 8 months ago
by
clefourrier
💎 Resources and community initiatives around the Leaderboard! 💎
pinned#174 opened 9 months ago
by
clefourrier
Detailed results are inconsistent
#734 opened about 11 hours ago
by
sbdzdz
porting-app-poc
2
#732 opened 1 day ago
by
alozowski
Understanding raw result data files
#729 opened 4 days ago
by
jerome-white
Mixtral 8x22B evaluation failed
1
#718 opened 9 days ago
by
a-normal-username
TRI-ML/mamba-7b-rw failed
7
#704 opened 14 days ago
by
devingulliver
GSM8K failure with Llama 3 finetunes
4
#703 opened 14 days ago
by
jeiku
GPTQ and Mixtral models will need to be relaunched
6
#692 opened 18 days ago
by
CombinHorizon
ALL Jamba models failing
13
#690 opened 19 days ago
by
devingulliver
72b models eval failed
12
#689 opened 19 days ago
by
paloalma
No good way to identify number of activated parameters causes MIxtral evaluation failures
21
#680 opened 22 days ago
by
0-hero
Crowd-Source Hardware for the LeaderBoard?
4
#570 opened 3 months ago
by
ibivibiv
Eval models for data contamination?
2
#561 opened 3 months ago
by
liyucheng
Feature request: Run 100B + models automatically
12
#434 opened 5 months ago
by
ChuckMcSneed
Feature Request for Leaderboard: date added to hub
2
#425 opened 5 months ago
by
madmaxbr5
Feature request: Using weights hash to identify duplicates
1
#422 opened 5 months ago
by
mrfakename
Feature request: Add non AutoModelForCausalLM models
3
#391 opened 6 months ago
by
KnutJaegersberg
Tool: Adding evaluation results to model cards
46
#370 opened 6 months ago
by
Weyaxi
Feature suggestion: average of selected (rather than all) columns
4
#368 opened 6 months ago
by
Minus0
Tool: Open LLM Leaderboard Model Renamer
31
#310 opened 7 months ago
by
Weyaxi
Checking for toxicity too
9
#53 opened 11 months ago
by
ronald-d-rogers