FP8 LLMs for vLLM Collection Accurate FP8 quantized models by Neural Magic, ready for use with vLLM! โข 44 items โข Updated Oct 17, 2024 โข 65
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B Text Generation โข Updated 8 days ago โข 826k โข โข 1.09k
Emu3 Collection Emu3: Next-Token Prediction is All You Need โข 7 items โข Updated 4 days ago โข 68