g4-ann

Full Transformers safetensors model based on google/gemma-4-E4B-it.

Notes

  • This is a Gemma-family model.
  • annihilate_merge_metadata.json records the base model used for this export.
  • The repo includes the chat template, tokenizer, generation config, processor config, and full model weights.

Quick Load

from transformers import AutoModelForCausalLM, AutoTokenizer

repo = "tjcrims0n/g4-ann"
tokenizer = AutoTokenizer.from_pretrained(repo)
model = AutoModelForCausalLM.from_pretrained(repo, device_map="auto")

messages = [{"role": "user", "content": "Hello."}]
inputs = tokenizer.apply_chat_template(messages, return_tensors="pt", add_generation_prompt=True).to(model.device)
outputs = model.generate(inputs, max_new_tokens=80)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))

Smoke Test Notes

A local smoke test returned coherent answers for greeting, arithmetic, and model-family identity prompts.

Downloads last month
22
Safetensors
Model size
8B params
Tensor type
F16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for tjcrims0n/g4-ann

Finetuned
(231)
this model