You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Log in or Sign Up to review the conditions and access this model content.

gemma4-e2b-scn-ollama-q4km

Direct Ollama distribution of the SCN Gemma4 artifact.

What this is

  • monolithic GGUF for Ollama local import
  • image + text capable
  • audio removed by design
  • derived from the official Ollama Gemma4 monolithic blob, then audio tensors / metadata were pruned
  • packaged so that hf download -> ollama create -f Modelfile is enough
  • Related Hugging Face checkpoint repo: st4system/gemma4-e2b-scn

Why this repo exists

The older split GGUF + mmproj path is appropriate for llama.cpp, but not for direct Ollama Gemma4 vision use. This repo contains the single-GGUF artifact required by Ollama users.

Measured effect

  • source Ollama GGUF blob size before pruning: 6.7G
  • GGUF file size: 6.1G
  • removed audio metadata fields: 6
  • removed audio tensors: 752
  • kept tensors after pruning: 1,260
  • general.parameter_count after pruning: 4,815,994,851
  • size delta after pruning: 0.6G saved (8.58%)
  • quantization: q4_k_m

Validated

  • ollama show reports vision
  • Ollama text runtime passed
  • Ollama image runtime passed on the sample image in this project
  • audio is unsupported by design
  • PNG input was validated for image runtime

Install for Ollama users

HF_HUB_DISABLE_XET=1 hf download st4system/gemma4-e2b-scn-ollama-q4km --repo-type model --local-dir ~/models/gemma4-e2b-scn-ollama-q4km
cd ~/models/gemma4-e2b-scn-ollama-q4km
ollama create gemma4-scn -f Modelfile
ollama run gemma4-scn "こんにちは"

Vision smoke:

ollama show gemma4-scn

If Capabilities includes vision, the model is packaged correctly.

For actual image inference, prefer PNG or JPEG inputs in Ollama clients.

Files

  • gemma4-e2b-scn-q4_k_m.gguf
  • Modelfile
  • ollama_manifest.json
  • SHA256SUMS
  • NOTICE.txt

Compliance notes

  • This is a derivative of google/gemma-4-e2b-it
  • This is not an official Google release
  • OpenAI / ChatGPT generated training data was not used in this project
Downloads last month
-
GGUF
Model size
5B params
Architecture
gemma4
Hardware compatibility
Log In to add your hardware

4-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support