liodon-ai/slm-10m · Announcing SLM-Bench: Evaluation Benchmark for SLM-10M

Announcing SLM-Bench: Evaluation Benchmark for SLM-10M

by PY-AI-Dev - opened 3 days ago

Liodon AI org 3 days ago

We've released SLM-Bench, a benchmark specifically designed for evaluating sub-10M models like SLM-10M.

6 categories, 500 questions each (3,000 total):

Zero manual annotation. The entire benchmark is generated by code, making it fully reproducible and transparent.

lm_eval --model hf \
  --model_args pretrained=liodon-ai/slm-10m,trust_remote_code=True \
  --tasks slm_bench \
  --device cuda:0 --batch_size 64

Apache-2.0 license.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment