Gemma 4 E2B Humanize-RL β€” SFT LoRA adapter

LoRA adapter trained on the humanize-rl SFT dataset (jayshah5696/humanize-rl-sft-dataset) over unsloth/gemma-4-E2B-it. Designed to shift the base model's prose toward the humanize-rl rubric: natural, concise, low corporate filler, format-faithful.

For inference, prefer the pre-merged checkpoint jayshah5696/gemma4-e2b-humanize-unsloth-merged (parity-verified) unless you specifically need the adapter on top of a different base.

Quickstart β€” direct LoRA loading

import torch
from peft import PeftModel
from transformers import AutoModelForImageTextToText, AutoProcessor

base = AutoModelForImageTextToText.from_pretrained(
    "unsloth/gemma-4-E2B-it", torch_dtype=torch.bfloat16, device_map="auto"
)
model = PeftModel.from_pretrained(base, "jayshah5696/gemma4-e2b-humanize-unsloth-lora")
processor = AutoProcessor.from_pretrained("unsloth/gemma-4-E2B-it")

Provenance

  • base model: unsloth/gemma-4-E2B-it
  • training framework: Unsloth FastModel + TRL SFTTrainer
  • LoRA rank: 8
  • LoRA alpha: 8
  • bf16, no QLoRA
  • target modules: regex covering language-model {q,k,v,o,gate,up,down}_proj (PEFT skips Gemma 4 KV-shared layers whose k_proj/v_proj modules do not exist; this is correct).
  • license: Apache-2.0 (matches base)

Verification report

Known limitations

  • Gemma 4 KV-shared layers (indices 15-34) do not have k_proj/v_proj modules to attach LoRA to. PEFT silently skips them. This is correct and documented in transformers PR #45328.
  • Loaders other than Unsloth + PEFT may not understand this adapter's regex target_modules. If so, expand to a list before publishing derivatives.
Downloads last month
58
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for jayshah5696/gemma4-e2b-humanize-unsloth-lora

Adapter
(33)
this model