Apostate Edited Model

Base model: google/gemma-4-E4B-it

Metrics

Metric Value
Baseline refusal 95.8%
Edited refusal 3.8%
Refusal metric classifier + weak guard
Harmless KL 0.149
KL target 0.300
Preserve rank 8
Preserve source harmless
Direction layer 25
Elapsed 1330.6 sec

Measurement

field value
edit type weight projection
refusal judge classifier + weak guard
preservation metric harmless kl
Downloads last month
22
Safetensors
Model size
9B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Spaceballs/gemma-4-E4B-it-apostate

Finetuned
(209)
this model
Quantizations
1 model

Collection including Spaceballs/gemma-4-E4B-it-apostate