voidful/agent-sft
Viewer • Updated • 309k • 125
This model is a full-weight fine-tune of google/gemma-4-12B-it on
voidful/agent-sft.
It is not LoRA, QLoRA, or a merged adapter. The SFT run updated the language model weights and kept the vision/audio embedding stacks frozen so the multimodal interface remained loadable. Red-square image smoke tests passed for the selected checkpoint.
wave002-c400-checkpoint-400/work/voidful2nlp/gemma-agent-sft/preserved_checkpoints_12b/wave002-c400-checkpoint-400wave001-checkpoint-40081925e-7voidful/claw-eval-zh --language tw --suite all --coregoogle/gemma-4-31B-itBest complete CLAW TW core score:
9.567 / 20 = 47.83%
Baseline google/gemma-4-12B-it score from the same evaluation setup:
8.106 / 20 = 40.53%
See PLAYBOOK.md for the training, Slurm, checkpoint, smoke-test, and
evaluation exploration log. The selected checkpoint config is in
training_config.yml; selected evaluation artifacts are under eval_results/.