open_ita_llm_leaderboard / leaderboard_general.csv
FinancialSupport's picture
Update leaderboard_general.csv
9c80b7c verified
raw
history blame
No virus
2.35 kB
model ,m_mmlu_it acc shot 3,m_mmlu_it acc shot 5,m_mmlu_it acc shot 0,belebele_ita_Latn acc,belebele_ita_Latn acc norm,helloswag_it acc,helloswag_it acc norm,lambada_openai_mt_it perplexity,lambada_openai_mt_it acc,xcopa_it acc,arc_it acc,arc_it acc norm
giux78/zefiro-7b-sft-qlora-ITA-v0.5,0.5196,0.5246,0.4762,0.4656,0.4656,0.4636,0.6097,22.5232,0.5154,0.67,0.1642,0.4397
mii-llm/maestrale-chat-v0.2-alpha,0.519,0.5163,0.4682,0.4678,0.4678,0.519,0.6852,26.0037,0.4987,0.722,0.1206,0.4585
FinancialSupport/saiga-7b,0.4973,0.4933,0.4982,0.5222,0.5222,0.4824,0.6342,30.2369,0.4671,0.672,0.16,0.4748
giux78/zefiro-7b-beta-ITA-v0.1,0.5297,0.5203,0.4716,0.45,0.45,0.4607,0.6129,25.8213,0.5013,0.666,0.0838,0.4294
raicritis/Hermes7b_ITA,,0.3574,0.3381,0.3689,0.3689,0.4112,0.5407,34.7106,0.4677,0.66,0.1249,0.3524
DeepMount/Mistral-Ita-7b,,0.3879,0.3538,0.38,0.38,0.3978,0.5123,89.99,0.3361,0.592,0,0.3747
galatolo/cerbero-7B,,0.5137,0.4867,0.5089,0.5089,0.4722,0.6135,23.4551,0.4964,0.672,0.1001,0.4465
mii-11m/maestrale-chat-v0.3-alpha,,0.5164,0.4774,0.5911,0.5911,0.5046,0.66,38.2427,0.4378,0.692,0.1343,0.4568
giux78/zefiro-7b-dpo-qlora-ITA-v0.7,0.508,0.5203,0.4717,0.4778,0.4778,0.4914,0.6428,23.6041,0.5174,0.684,0.1805,0.4611
mii-llm/maestrale-chat-v0.3-beta,,0.5129,,0.5644,0.5644,0.5067,0.6581,53.0646,0.4207,0.72,0.1463,0.4559
swap-uniba/LLaMAntino-2-7b-hf-ITA,,0.3696,,0.2433,0.2433,0.4113,0.5428,33.6146,0.4696,0.678,0.139,0.3456
mistralai/Mistral-7B-v0.1,,0.5253,,0.41,0.41,0.4486,0.6122,30.2635,0.4894,0.658,0.1061,0.4149
swap-uniba/LLaMAntino-2-70b-hf-UltraChat-ITA,,0.6084,,,,0.5085,0.6566,,,,0.2481,0.5004
MoxoffSpA/Azzurro,,0.5084,,,,0.5027,0.6074,,,,0.1497,0.4414
MoxoffSpA/Volare,,0.5509,,,,0.5027,0.6473,,,,0.1471,0.4713
DeepMount00/Mistral-Ita-7b,,0.5324,,,,0.5024,0.6728,,,,0.1574,0.5475
seeweb/SeewebLLM-it,,0.3487,,,,0.4062,0.5382,,,,0.0470,0.3507
swap-uniba/LLaMAntino-2-chat-13b-hf-UltraChat-ITA,,0.4472,,,,0.4486,0.5708,,,,0.1933,0.4046
giux78/llama3-8B-usenet-merged,,0.5485,,0.5511,0.5511,0.4634,0.6292,27.1629,0.5009,0.714,0.1027,0.4525
DeepMount00/Llama-3-8b-Ita,,0.572,,,,0.4998,0.6528,,,,0.2104,0.546
FairMind/Phi-3-mini-4k-instruct-bnb-4bit-Ita,,0.5389,,,,,0.5841,,,,,0.4414
FairMind/Llama-3-8B-4bit-UltraChat-Ita,,0.5328,,,,,0.6064,,,,,0.4611
DeepMount00/ITA_Foundation_LLM,,0.2952,,,,0.3395,0.4226,,,,0.0565,0.2857