mlx-community/QVQ-72B-Preview-8bit

This model was converted to MLX format from Qwen/QVQ-72B-Preview using mlx-vlm version 0.1.6. Refer to the original model card for more details on the model.

Use with mlx

pip install -U mlx-vlm

python -m mlx_vlm.generate --model mlx-community/QVQ-72B-Preview-8bit --max-tokens 100 --temp 0.0

Safetensors

Model size

20.6B params

Tensor type

FP16

U32

Inference Providers NEW

This model is not currently available via any of the supported Inference Providers.

Base model

Finetuned

(13)

this model