ModelCloud/Falcon3-10B-Instruct-gptqmodel-4bit-vortex-v1 Text Generation • Updated 5 days ago • 35 • 3
ModelCloud/QwQ-32B-Preview-gptqmodel-4bit-vortex-v1 Text Generation • Updated 8 days ago • 1.16k • 51
Vortex Collection ModelCloud optimized and validated quants that pass/meet strict quality assurance on multiple benchmarks. • 9 items • Updated about 9 hours ago • 7
ModelCloud/QwQ-32B-Preview-gptqmodel-4bit-vortex-v1 Text Generation • Updated 8 days ago • 1.16k • 51
view article Article Accelerating LLM Inference: Fast Sampling with Gumbel-Max Trick By cxdu • Oct 24 • 10
ModelCloud/Qwen2.5-Coder-32B-Instruct-gptqmodel-4bit-vortex-v1 Text Generation • Updated Nov 14 • 712 • 12
ModelCloud/Qwen2.5-Coder-32B-Instruct-gptqmodel-4bit-vortex-v1 Text Generation • Updated Nov 14 • 712 • 12
ModelCloud/Llama-3.2-1B-Instruct-gptqmodel-4bit-vortex-v2.5 Text Generation • Updated Nov 11 • 1.89k • 2
ModelCloud/Llama-3.2-3B-Instruct-gptqmodel-4bit-vortex-v3 Text Generation • Updated Nov 11 • 1.62k • 4