Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads Paper โข 2401.10774 โข Published Jan 19 โข 50