How to use from the
Use from the
llama-cpp-python library
# !pip install llama-cpp-python

from llama_cpp import Llama

llm = Llama.from_pretrained(
	repo_id="strykes/SteraFunctionGemma-270M",
	filename="",
)
llm.create_chat_completion(
	messages = [
		{
			"role": "user",
			"content": "What is the capital of France?"
		}
	]
)

SteraFunctionGemma-270M

A full fine-tune of google/functiongemma-270m-it (Gemma 3, 270M) on the ~30k-example Tiny-Giant agentic tool-use / debugging dataset.

An ultra-small (270M) agentic coder. The Q4_K_M GGUF is tiny (~200 MB) and runs comfortably CPU-only (laptops, small VPS), while speaking the deterministic Hermes/ChatML <tool_call> format used by the Tiny-Giant harness.

Files

File Description
SteraFunctionGemma-270M-Q4_K_M.gguf Q4_K_M quant (~200 MB) — llama.cpp / Ollama / LM Studio, CPU-friendly
SteraFunctionGemma-270M-f16.gguf f16 GGUF — re-quantize to any level without retraining
raw_weights/ Full bf16 safetensors HF checkpoint
val_meta.jsonl Held-out validation set shipped with the model

Training

  • Base: google/functiongemma-270m-it (Gemma 3, 270M, gated/Apache-style Gemma license)
  • Method: full fine-tune (not LoRA), bf16 + gradient checkpointing
  • Data: ~30k Tiny-Giant agentic tool-use / debugging conversations
  • Epochs: 2 · LR: 1e-5 (cosine, 3% warmup) · Seq len: 4096

Prompt format

Trained with an explicit ChatML / Hermes renderer (not Gemma's native <start_of_turn> template). Pin ChatML when serving (--chat-template chatml). Tool calls:

<tool_call>
{"name": "<function-name>", "arguments": {...}}
</tool_call>

Inference (llama.cpp, CPU-friendly)

llama-cli -m SteraFunctionGemma-270M-Q4_K_M.gguf --chat-template chatml

License

Inherits the Gemma license from the google/functiongemma-270m-it base model.

Downloads last month
18
GGUF
Model size
0.3B params
Architecture
gemma3
Hardware compatibility
Log In to add your hardware

4-bit

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for strykes/SteraFunctionGemma-270M

Quantized
(49)
this model