Keye-VL-2.0-30B-A3B — Abliterated

An abliterated (refusal-direction-ablated) build of Kwai-Keye/Keye-VL-2.0-30B-A3B, a 30B-A3B vision-language MoE. The vision capability is fully preserved — only the text decoder's attention output projections were modified.

⚠️ Use a CUDA stack (vLLM / SGLang / Transformers + flash-attn). This model uses a custom sparse-attention indexer (SALightningIndexer) built for CUDA kernels (flash-attn, DeepGEMM). On Apple Silicon / MPS, generation via Transformers is unstable for the base model too (not an artifact of abliteration) — for Mac, use the MLX builds in the osmapi collection, which run dense attention.

What was changed

  • Method: Heretic v1.3.0 — automated directional ablation (TPE-optimized over refusal count + KL divergence), 48 trials.
  • Target: attention o_proj of the text decoder, in the mid/late layers where the refusal direction lives (early layers, lm_head, embed_tokens untouched).
  • Best trial: marker-refusals 43→37/48, KL ≈ 0.026 (low → general capability largely retained). This is a moderate abliteration: on this batched-MoE architecture Heretic can only reach o_proj (the experts' FFN down_proj are fused batched tensors it doesn't target), so the MLP path's refusal contribution is not ablated. A stronger custom ablation is possible.

Preserved (verified byte-identical to base)

  • Vision tower visual.vision_model.* (SigLIP encoder + native-resolution packing) — 438 tensors
  • Projector mlp_AR.* (2×2 spatial-merge → text space)
  • lm_head, embed_tokens
  • MTP: the base model has no multi-token-prediction module (nothing to preserve).

Usage (CUDA)

from transformers import pipeline
pipe = pipeline("image-text-to-text",
                model="osmapi/osmKeye-VL-2.0-30B-A3B-uncensored",
                trust_remote_code=True)

Or serve with vLLM / SGLang exactly as the base model.

Provenance / reproducibility

Produced on an Apple M4 Max (128 GB) by abliterating in an isolated venv pinned to transformers==4.57.3 (the model's build version; 5.x removed OutputRecorder). Running this CUDA-built model on macOS required patching out a flash-attn assert, replacing the CUDA fast_hadamard_transform with a pure-torch Walsh–Hadamard transform, and adding AutoModelForImageTextToText to auto_map. Heretic targeted o_proj via a row-normalized LoRA merged into the weights.

Abliteration removes safety alignment; you are responsible for how you use this model.

Other variants of this model (public on osmapi)

Downloads last month
26
Safetensors
Model size
31B params
Tensor type
F32
·
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for osmapi/osmKeye-VL-2.0-30B-A3B-uncensored

Finetuned
(1)
this model