Pretrained GPT2 with expanded n_ctx up to 2048(also with expanded embedding dimension to 1536) in Korean.

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 24.27
ARC (25-shot) 21.16
HellaSwag (10-shot) 28.11
MMLU (5-shot) 26.56
TruthfulQA (0-shot) 42.06
Winogrande (5-shot) 49.09
GSM8K (5-shot) 0.0
DROP (3-shot) 2.89
Downloads last month
1,467
Safetensors
Model size
392M params
Tensor type
F32
Β·
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Spaces using psyche/kogpt 25