Swarm Sovereign bee mascot

Swarm Sovereign 12B GGUF

Swarm Sovereign 12B is a Gemma 4 12B IT based chat model prepared for local inference and LM Studio/llama.cpp usage.

This release contains both a Q4_K_M GGUF build and the merged Hugging Face safetensors build.

GGUF build:

  • File: swarm-sovereign-12b-Q4_K_M.gguf
  • Base model: google/gemma-4-12B-it
  • Architecture: Gemma 4
  • Quantization: Q4_K_M
  • Approximate file size: 6.9 GB
  • LM Studio estimated memory at 4096 context / 100% GPU offload: 7.71 GiB
  • Identity: Swarm Sovereign

Hugging Face / Transformers build:

  • Files: model-00001-of-00005.safetensors through model-00005-of-00005.safetensors, plus tokenizer/config files
  • Approximate directory size: 22 GB
  • Use this option for Transformers/PEFT-style local loading or downstream conversion workflows.

Local usage

llama.cpp

llama-cli \
  -m swarm-sovereign-12b-Q4_K_M.gguf \
  -ngl 99 \
  -c 4096 \
  --jinja \
  -p "What is your name? Answer in one sentence."

LM Studio

Import or download the GGUF in LM Studio, then load it as a local model. The local test identifier used during release validation was:

swarm-sovereign-12b

Validation

Local validation on Apple Silicon / LM Studio / llama.cpp:

Prompt: What is your name? Answer in one sentence.
Answer: My name is Swarm Sovereign.

LM Studio memory estimate at 4096 context with full GPU offload:

Estimated GPU Memory:   7.71 GiB
Estimated Total Memory: 7.71 GiB

Want to build and manage an entire private swarm of agents?

Want to build and manage an entire private swarm of agents all with shared memory, skills, and single setup?

Check out: https://hivemindos.liamvisionary.com X: @TheHivemindOS

Notes

This model is intended for local experimentation and private agent workflows. Test thoroughly for your use case before deploying in production.

Downloads last month
42
Safetensors
Model size
12B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for LiamVisionary/swarm-sovereign-12b

Quantized
(80)
this model