โš›๏ธ Q Model: Optimized for Enhanced Quantized Inference Capability

This model has been specially optimized to improve the performance of quantized inference and is recommended for use in 3 to 8-bit quantization scenarios.

Downloads last month
351
GGUF
Model size
32.8B params
Architecture
qwen2

3-bit

8-bit

Inference API
Unable to determine this model's library. Check the docs .

Model tree for OpenBuddy/openbuddy-qwen2.5coder-32b-v24.1q-200k-gguf