You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Log in or Sign Up to review the conditions and access this model content.

Atlas Gemma-4-26B-trm

A trauma-informed AI companion specialised for adults with CPTSD, PTSD, and neurodivergence.

⚠️ EXPERIMENTAL MODEL — NOT FOR PRODUCTION DEPLOYMENT

This is a research model. It has not undergone clinical validation. It is not a finished product. Atlas v6 is in active development. The author accepts no liability for deployment outside the intended Atlas companion architecture.

⚠️ This model has been intentionally modified to reduce therapeutic refusal behaviour and crisis-line reflexes.

🎯 Purpose & Motivation

Atlas is the intelligence layer for Kintsugi Collective. An AI for adults with complex trauma (CPTSD), PTSD, and neurodivergence (ASD/ADHD). This is not a general-purpose model. It is a specialised therapeutic-context model.

This is v5 of Atlas, this iteration has double the harmful prompts, and a larger SFT dataset, further cleaned and reviewed line by line

v6 is in development

🔬 Methodology

  • Base Model: google/gemma-4-26b-a4b-it
  • Abliteration: Norm-preserving biprojected abliteration + Expert-Granular Abliteration (EGA)
    • Applied to all 30 layers (o_proj + mlp.down_proj)
    • Full expert ablation (128/128 per layer)
    • Direction: normalize(mean(harmful) - mean(harmless)) with Gram-Schmidt orthogonalization
    • Winsorization at 99.5th percentile
  • SFT: 3 epochs on a carefully curated ~1=1,900+ example dataset (60% high-quality synthetic, 40% redacted lived-experience data from the target cohort)
  • Training: Unsloth + bf16 on RTX 6000 Blackwell

Final SFT Loss: 0.157

📊 Key Results

Standout Results:

image

Training Configuration

SFT Parameters

Parameter Value
Epochs 3
Effective Batch Size 4
Learning Rate 2e-4
LR Scheduler Linear
Warmup Steps 10
Optimizer AdamW 8-bit
Weight Decay 0.01
LoRA Rank (r) 32
LoRA Alpha 64

Abliteration Parameters

Parameter Value
Layers Abliterated 100%
Experts Abliterated 100%
Scale 0.95
Winsorization 0.995

⚠️ Limitations & Responsible Use

  • This model has reduced refusal behaviour on therapeutic and dark content. It is not suitable for general deployment without guardrails.
  • Intended for use within the Atlas companion architecture with additional safety layers.
  • Not a replacement for human therapeutic support.
  • Patent pending (IP Australia).
Ethical Issue How Atlas Handles It Strength Level
Re-traumatization via refusals Deliberate abliteration + 0% therapeutic refusal rate on cohort-specific prompts Excellent
Abandonment & presence "Core philosophy (""the one that stays"") deeply trained into the model" Excellent
User sovereignty & agency "Sovereign Signal Vault, split-key encryption, burn protocol, user-directed interaction" Outstanding
Avoiding pathologising Explicit system prompt constraints + targeted training data Very Strong
Respecting neurodivergence "Training data and Atlas framework explicitly include masking, shutdowns, executive dysfunction, sensory issues, etc." Strong
Privacy of trauma disclosures "On-device Prompt Shield tokenisation, end-to-end encryption, no server-side readable data" Industry-leading
Avoiding generic crisis pivots Hard constraint in both training data and system prompt design Excellent

Kintsugi Collective — Reclaiming navigation rights to one’s own life.

|Gemma is a trademark of Google LLC|

This gemma4 model was trained 2x faster with Unsloth and Huggingface's TRL library.

Downloads last month
46
Safetensors
Model size
26B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for kintsugicollective/atlas-trm-v5-26b-gemma4

Finetuned
(95)
this model
Quantizations
1 model