Boogu-Image-0.1-Base GGUF

Required Text Encoder & VAE

To run the Boogu architecture properly, you cannot use standard SD or Flux encoders. You must download the specific VAE and multimodal text encoders.

  • Text Encoder (Qwen3-VL): You must use an FP8 scaled version of the Qwen3-VL encoder.
  • VAE (Flux): The Boogu pipeline utilizes the standard Flux VAE.

⚠️ Why are there no Q2 or Q3 K-Quants?

Currently, this repo only provides Flat Quants (Q4_0, Q4_1, Q5_0, Q5_1, Q8_0).

Standard K-quants (like Q2_K or Q3_K_M) require a hardcoded architectural mapping blueprint inside the llama.cpp source code. Because the Boogu/OmniGen architecture is brand new, those K-quant blueprints do not exist in the compiler yet. Flat quants bypass this requirement by forcing all 2D tensors to the target bit-depth.

If you are running an 8GB VRAM setup (like an RTX 3070), the Q4_0 is the recommended sweet spot for VRAM savings and quality.

How to use in ComfyUI

Prerequisite: Core Update Override Native support for the Boogu/OmniGen architecture has been merged from Pull Request #14523. If your Load Clip node doesnt have the Boogu architecture, you must manually fetch the PR into your ComfyUI installation.

  1. Open a command prompt directly inside your ComfyUI folder.
  2. Run the following commands to fetch and switch to the PR branch:
   git fetch origin pull/14523/head:boogu-pr
   git checkout boogu-pr
Downloads last month
929
GGUF
Model size
10B params
Architecture
lumina2
Hardware compatibility
Log In to add your hardware

4-bit

5-bit

8-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for realrebelai/Boogu-Image-Base_GGUFs

Quantized
(1)
this model