GLM-5.2-Fable5-R2

LoRA fine-tune of GLM-5.2 (Test 3a ablated base) using Fable 5 distillation data.

Training Details

Parameter Value
Base model GLM-5.2 Test 3a (alignment-ablated)
Method LoRA (rank=64, alpha=128)
Target layers 60-77 (top 18 of 78)
Trainable params 98M (0.013% of 753B)
Learning rate 2e-5, cosine schedule
Steps 610 (1 epoch)
Training data 4,876 Fable 5 distillation examples
Hardware 8x H200 (141GB each)

Benchmark Results (v2 harness, accurate grading)

Benchmark Fable5-R2 Test 3a (base) Delta
GSM8K 99% 91% +8pp
HumanEval 84.2% 67.7% +16.5pp
HellaSwag 80.5% 78% +2.5pp
MMLU-Pro 84% 82% +2pp
GPQA 8% - -
SimpleQA 32% - -
AdvBench refusal 91% 1% +90pp (regression)
Borderline refusal 0% 0% -

Known Issues

  • High refusal rate (91% AdvBench): LoRA re-activated alignment circuits ablated in Test 3a
  • Visible chain-of-thought contamination in responses
  • Not for production use without additional alignment work

Usage

from transformers import AutoModelForCausalLM, AutoTokenizer
model = AutoModelForCausalLM.from_pretrained("cfontes/GLM-5.2-Fable5-R2", torch_dtype="auto", device_map="auto", trust_remote_code=True)
tokenizer = AutoTokenizer.from_pretrained("cfontes/GLM-5.2-Fable5-R2", trust_remote_code=True)
Downloads last month
-
Safetensors
Model size
743B params
Tensor type
F32
·
BF16
·
F8_E4M3
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for cfontes/GLM-5.2-Fable5-R2

Base model

zai-org/GLM-5.2
Adapter
(1)
this model