Qwen3-4B CoT Compression Study - a ssurface Collection

ssurface 's Collections

updated 5 days ago

LoRA adapters trained for 5 progressively shorter chain-of-thought styles on GSM8K, plus the eval artifacts behind the Pareto curve.