ultragemma4-12b-heretic-uncensored

Reasoning-capable language model modified using the Heretic abliteration toolkit

Abliteration 12B Parameters Reasoning Uncensored

ultragemma4-12b-heretic-uncensored is a reasoning-capable language model built on top of google/gemma-4-12B-it and modified using the heretic abliteration toolkit. The model applies refusal-direction analysis and targeted weight-space interventions to reduce internal refusal behaviors while preserving instruction-following, reasoning capabilities, and general conversational performance.

Important

This model is intended strictly for research and learning purposes. Due to reduced internal refusal mechanisms, it may generate sensitive or unrestricted content. Users assume full responsibility for how the model is used. The authors and hosting platform disclaim any liability for generated outputs.

Note

This model is experimental and may generate unexpected behaviors or artifacts in certain scenarios.

Key Highlights

  • Heretic-Based Abliteration: Modified using the Heretic toolkit to identify and alter refusal-related representations within the model.
  • Reduced Refusal Behavior: Optimized to minimize internal refusal tendencies while maintaining instruction-following capabilities.
  • Gemma 4 12B Unified Backbone: Built directly on top of google/gemma-4-12B-it.
  • Multimodal Foundation: Inherits native text, image, audio, and video understanding capabilities from the Gemma 4 Unified architecture.
  • Reasoning-Oriented Performance: Preserves multi-step reasoning and analytical capabilities after abliteration.
  • Research-Focused Release: Designed for alignment research, model behavior analysis, and evaluation of refusal-direction modifications.
  • 12B Scale Deployment: Suitable for local inference, research environments, and optimized deployment setups.

Model Files

File Name Quant Type File Size File Link
ultragemma4-12b-heretic-uncensored.BF16.gguf BF16 23.8 GB Download
ultragemma4-12b-heretic-uncensored.F16.gguf F16 23.8 GB Download
ultragemma4-12b-heretic-uncensored.Q2_K.gguf Q2_K 4.83 GB Download
ultragemma4-12b-heretic-uncensored.Q3_K_L.gguf Q3_K_L 6.57 GB Download
ultragemma4-12b-heretic-uncensored.Q3_K_M.gguf Q3_K_M 6.09 GB Download
ultragemma4-12b-heretic-uncensored.Q3_K_S.gguf Q3_K_S 5.53 GB Download
ultragemma4-12b-heretic-uncensored.Q4_0.gguf Q4_0 6.98 GB Download
ultragemma4-12b-heretic-uncensored.Q4_K_M.gguf Q4_K_M 7.38 GB Download
ultragemma4-12b-heretic-uncensored.Q4_K_S.gguf Q4_K_S 7.02 GB Download
ultragemma4-12b-heretic-uncensored.Q5_0.gguf Q5_0 8.34 GB Download
ultragemma4-12b-heretic-uncensored.Q5_K_M.gguf Q5_K_M 8.55 GB Download
ultragemma4-12b-heretic-uncensored.Q5_K_S.gguf Q5_K_S 8.34 GB Download
ultragemma4-12b-heretic-uncensored.Q6_K.gguf Q6_K 9.79 GB Download
ultragemma4-12b-heretic-uncensored.Q8_0.gguf Q8_0 12.7 GB Download
ultragemma4-12b-heretic-uncensored.mmproj-bf16.gguf mmproj-bf16 175 MB Download
ultragemma4-12b-heretic-uncensored.mmproj-f16.gguf mmproj-f16 175 MB Download
ultragemma4-12b-heretic-uncensored.mmproj-q8_0.gguf mmproj-q8_0 159 MB Download

Intended Use

  • Alignment Research: Studying refusal-direction analysis and behavior modification techniques.
  • Model Evaluation: Benchmarking reasoning, instruction-following, and safety-related behaviors.
  • Red Teaming: Analyzing model responses under reduced-refusal conditions.
  • Local Deployment: Running Gemma 4 Unified models in research and experimentation environments.
  • Abliteration Studies: Exploring the effects of targeted weight-space modifications on model behavior.

Limitations & Risks

Important Note: This model intentionally reduces built-in refusal mechanisms.

  • Sensitive Content Risk: May generate unrestricted, controversial, or unsafe outputs.
  • User Responsibility: Requires careful and ethical use.
  • Experimental Modifications: Behavior may differ significantly from the original model.
  • Alignment Trade-offs: Reduced refusal behavior may impact safety filtering and response constraints.
  • Potential Artifacts: Certain prompts may expose unexpected outputs resulting from the abliteration process.

Acknowledgements

  • google/gemma-4-12B-it: Gemma 4 12B Unified is part of the Gemma 4 family of open models. Built with the same multimodal functionality as Gemma 4 E2B and E4B (text, audio, image, and video inputs), it brings native audio and vision understanding directly to local environments without the need for separate encoders. The model uses the gemma4_unified architecture and supports advanced multimodal reasoning while remaining deployable on consumer hardware.

  • Heretic: Fully automatic censorship removal framework for language models. This project was used to perform the refusal-direction analysis and ablation procedures that form the foundation of this model.

Abliteration Parameters

Parameter Value
direction_index 41.41
attn.o_proj.max_weight 1.48
attn.o_proj.max_weight_position 29.17
attn.o_proj.min_weight 0.38
attn.o_proj.min_weight_distance 24.43
mlp.down_proj.max_weight 1.41
mlp.down_proj.max_weight_position 32.44
mlp.down_proj.min_weight 0.47
mlp.down_proj.min_weight_distance 28.03

Refusal Evaluation

Metric This model Original model (google/gemma-4-12B-it)
Refusals 3/100 98/100

llama.cpp

LLM inference in C/C++ — https://github.com/ggml-org/llama.cpp

license

Gemma 4 [Apache License 2.0] — https://ai.google.dev/gemma/apache_2

Downloads last month
450
GGUF
Model size
12B params
Architecture
gemma4
Hardware compatibility
Log In to add your hardware

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for prithivMLmods/ultragemma4-12b-heretic-uncensored

Quantized
(197)
this model

Collection including prithivMLmods/ultragemma4-12b-heretic-uncensored