Gemma4-Gutenberg-31B-LoRA

The standalone LoRA adapter (r=64, text decoder only) behind nbeerbower/Gemma4-Gutenberg-31B — a Gutenberg-series ORPO finetune of google/gemma-4-31B-it for literary, novelistic prose.

~1.96 GB. Apply to the standard base, or to any architecturally-identical Gemma-4-31B variant — it transplants cleanly onto the abliterated heretic base (the adapter dominates the prose voice regardless of base; see Gemma4-Gutenberg-31B-Heretic).

Use

import torch
from transformers import AutoModelForCausalLM
from peft import PeftModel

base = AutoModelForCausalLM.from_pretrained("google/gemma-4-31B-it", dtype=torch.bfloat16)
model = PeftModel.from_pretrained(base, "nbeerbower/Gemma4-Gutenberg-31B-LoRA")
# or .merge_and_unload() for a standalone full model

Training

ORPO (β=0.1) on schneewolflabs/Athanorlite-DPO (14,816 pairs; the Gutenberg "Encore" mix + more). LR 5e-5 cosine, eff batch 32, max_len 2048, 1 epoch on 1× NVIDIA GB10, via Merlina. reward_accuracy 0.18 → 0.91. Full details on the merged model card.

License

Apache-2.0 (matching the Gemma 4 base).

Downloads last month
34
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for nbeerbower/Gemma4-Gutenberg-31B-LoRA

Adapter
(112)
this model
Merges
1 model
Quantizations
1 model

Dataset used to train nbeerbower/Gemma4-Gutenberg-31B-LoRA