Curated collection of high-performance quantized LLMs optimized for efficient inference, lower VRAM usage, and production deployment.