mlx-community
/

gemma-4-12B-it-assistant-nvfp4

gemma4_unified_assistant

4-bit precision

Model card Files Files and versions

mlx-community/gemma-4-12B-it-assistant-nvfp4

This model was converted to MLX format from google/gemma-4-12B-it-assistant using mlx-vlm version 0.6.0. Refer to the original model card for more details on the model.

Use with mlx

pip install -U mlx-vlm

python -m mlx_vlm.generate --model mlx-community/gemma-4-12B-it-assistant-nvfp4 --max-tokens 100 --temperature 0.0 --prompt "Describe this image." --image <path_to_image>

Downloads last month: 52

Safetensors

Model size

0.1B params

Tensor type

U8

·

U32

·

BF16

·

MLX

Hardware compatibility

Log In to add your hardware

4-bit

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for mlx-community/gemma-4-12B-it-assistant-nvfp4

Base model

google/gemma-4-12B-it-assistant

Quantized

(11)

this model

Collection including mlx-community/gemma-4-12B-it-assistant-nvfp4

Gemma 4

88 items • Updated 1 day ago • 133