view article Article Fine-tune Deepseek-R1 with a Synthetic Reasoning Dataset By sdiazlor • 22 days ago • 45
SwiftKV: Fast Prefill-Optimized Inference with Knowledge-Preserving Model Transformation Paper • 2410.03960 • Published Oct 4, 2024 • 2