Instructions to use appautomaton/gemma-3-12b-it-backbone-4bit-mlx with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- MLX
How to use appautomaton/gemma-3-12b-it-backbone-4bit-mlx with MLX:
# Download the model from the Hub pip install huggingface_hub[hf_xet] huggingface-cli download --local-dir gemma-3-12b-it-backbone-4bit-mlx appautomaton/gemma-3-12b-it-backbone-4bit-mlx
- Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- LM Studio
Gemma 3 12B IT Text-Encoder Backbone (MLX, 4-bit)
MLX 4-bit conversion of the Gemma 3 12B IT text backbone. It serves as the text-conditioning encoder for DramaBox TTS in mlx-speech, exposing the per-layer hidden states the audio model conditions on. It is not a chat or text-generation model.
Backbone, not a full Gemma. The language-model head and generation path are removed. It emits hidden states only. Use a standard Gemma checkpoint for chat or text generation.
Model Details
- Developed by: App Automaton
- Upstream model:
google/gemma-3-12b-it, converted from thegemma-3-12b-it-qat-q4_0release - Role: text-encoder backbone. Produces all 49 hidden states consumed by the DramaBox prompt pipeline.
- Quantization: MLX 4-bit affine,
group_size=64, with bf16 scales and norms - Runtime: MLX on Apple Silicon
Contents
| File | Component | Format | Size |
|---|---|---|---|
model-0000{1,2}-of-00002.safetensors |
Gemma 3 12B text backbone | 4-bit affine | ~6.2 GB |
model.safetensors.index.json |
Shard index | JSON | n/a |
config.json |
Backbone and quantization config | JSON | n/a |
tokenizer.json, tokenizer.model, tokenizer_config.json |
Gemma tokenizer | JSON | n/a |
How to Get Started
This backbone is the text encoder for DramaBox. Download it alongside the DramaBox weights:
hf download appautomaton/gemma-3-12b-it-backbone-4bit-mlx \
--local-dir models/gemma_3_12b_it_backbone/mlx-4bit
It is then passed to DramaBoxModel.from_dir(..., gemma_dir=...). See the DramaBox card for the full pipeline. To load the backbone directly:
from mlx_speech.models.gemma3_text import load_gemma3_text_model, LTXVGemmaTokenizer
gemma, _ = load_gemma3_text_model("models/gemma_3_12b_it_backbone/mlx-4bit")
tokenizer = LTXVGemmaTokenizer.from_dir("models/gemma_3_12b_it_backbone/mlx-4bit")
Intended Use
Text conditioning for LTX-2-derived audio diffusion models, specifically DramaBox TTS. It is reusable as a Gemma 3 12B feature extractor on Apple Silicon, but it carries no generation head.
Links
- Source code:
appautomaton/mlx-speech - Paired model:
appautomaton/dramabox-tts-3.3b-bf16-mlx - Upstream model:
google/gemma-3-12b-it - More MLX speech models: App Automaton on Hugging Face
License
Gemma. Use is governed by Google's Gemma Terms of Use. By downloading or using these weights you agree to those terms.
- Downloads last month
- 174
4-bit