BitDistiller: Unleashing the Potential of Sub-4-Bit LLMs via Self-Distillation
Paper • 2402.10631 • Published • 2
| PPL | arc_easy | arc_challenge | piqa | winogrande | hellaswag | mmlu | QA Avg |
|---|---|---|---|---|---|---|---|
| 914841.62 | 25.25 ± 0.89 | 20.14 ± 1.17 | 53.70 ± 1.16 | 48.70 ± 1.40 | 25.59 ± 0.44 | - | 34.68 |
Training method based on BitDistiller Paper
Base model
TinyLlama/TinyLlama_v1.1