OpenBuddy
/

openbuddy-qwen2.5coder-32b-v24.1q-200k-gguf

Inference Endpoints

Model card Files Files and versions Community

⚛️ Q Model: Optimized for Enhanced Quantized Inference Capability

This model has been specially optimized to improve the performance of quantized inference and is recommended for use in 3 to 8-bit quantization scenarios.

Downloads last month: 351

GGUF

Model size

32.8B params

Architecture

qwen2

3-bit

8-bit

Inference API

Unable to determine this model's library. Check the docs .

Model tree for OpenBuddy/openbuddy-qwen2.5coder-32b-v24.1q-200k-gguf

Base model

Qwen/Qwen2.5-32B

Finetuned

Qwen/Qwen2.5-Coder-32B

Finetuned

Qwen/Qwen2.5-Coder-32B-Instruct

Finetuned

OpenBuddy/openbuddy-qwen2.5coder-32b-v24.1q-200k

Quantized

(2)

this model