VibeThinker-3B GGUF

This repository contains GGUF conversions and quantized GGUF files for WeiboAI/VibeThinker-3B.

The original model is licensed under the MIT License.

Files

File Description
VibeThinker-3B-Q4_K_M.gguf Recommended general local-use quantization
VibeThinker-3B-Q5_K_M.gguf Better quality, larger than Q4_K_M
VibeThinker-3B-Q8_0.gguf High quality, larger file
VibeThinker-3B-F16.gguf Full precision GGUF, largest file, optional

Usage with llama.cpp

llama-cli -m VibeThinker-3B-Q4_K_M.gguf -ngl 99 -c 8192 -cnv
Downloads last month
166
GGUF
Model size
3B params
Architecture
qwen2
Hardware compatibility
Log In to add your hardware

4-bit

5-bit

8-bit

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for Luncy1978/VibeThinker-3B-GGUF

Base model

Qwen/Qwen2.5-3B
Quantized
(32)
this model