SmolLM3-3B β€” Q4_K_M for IBM Power (Linux ppc64le + AIX)

SmolLM3-3B quantized to Q4_K_M with a Q6_K output head for fast, accurate CPU inference on IBM Power β€” POWER9 (VSX) and POWER10/11 (MMA-accelerated) via LibrePower. No GPU required. Size: 1.8G.

Run it

Ubuntu / Debian ppc64le:

curl -fsSL https://linux.librepower.org/install.sh | sudo sh
sudo apt install librepower-llama
wget https://huggingface.co/librepowerai/SmolLM3-3B-Power/resolve/main/SmolLM3-3B-Q4_K_M.gguf
lp-llama-completion -m SmolLM3-3B-Q4_K_M.gguf -p "Hello!" -n 64 -t $(nproc)

IBM AIX 7.3 (big-endian):

dnf install llama-aix
wget https://huggingface.co/librepowerai/SmolLM3-3B-Power/resolve/main/SmolLM3-3B-Q4_K_M-be.gguf
lp-llama-completion -m SmolLM3-3B-Q4_K_M-be.gguf -p "Hello!" -n 64 -t $(nproc)

Files

  • SmolLM3-3B-Q4_K_M.gguf β€” little-endian (Ubuntu/Linux ppc64le)
  • SmolLM3-3B-Q4_K_M-be.gguf β€” big-endian (IBM AIX)

Good for

Fully-open general assistant, multilingual, reasoning

Credits

Base model by its original authors (Apache-2.0). Quantization & Power packaging: LibrePower.

Downloads last month
61
GGUF
Model size
3B params
Architecture
smollm3
Hardware compatibility
Log In to add your hardware

4-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for librepowerai/SmolLM3-3B-Power

Quantized
(92)
this model