Gemma · caix
Collection
Google Gemma models converted to Apple Core AI (.aimodel) for native on-device inference with caix. Beta. • 14 items • Updated
status: staged experiment. built for distributed testing. token parity vs monolithic is not yet verified (open issue); not 1:1-verified.
Staged Core AI export for google/gemma-4-E2B, built for caix distributed inference testing on Apple silicon.
.aimodel bundlegoogle/gemma-4-E2Bcaix cluster plan accepts the manifest for a 64 GB Studio plus 32 GB MacBook setup; hardware runtime smoke is still pendingbrew upgrade redhillsmediafl/caix/caix || brew install redhillsmediafl/caix/caix
caix catalog install redhillsmediafl/rhm-gemma-4-e2b-staged-caix
caix cluster plan \
--manifest ~/.caix/models/exports/gemma4-e2b-staged-4bit-ctx128-2x/stage-manifest.json \
--workers studio=64,macbook=32 \
--kv-capacity 128
Run workers with caix cluster join and the coordinator with caix serve --cluster using the same manifest.
More open-source work: redhillsmediafl.com/open-source.
Base model
google/gemma-4-E2B