SCAIL-2 GGUFs (by Rebel AI)
GGUF quantizations of SCAIL-2, the end-to-end character-animation / video motion-transfer model (Wan 2.1 14B backbone) from zai-org. These run the SCAIL-2 DiT in ComfyUI at a fraction of the VRAM the full fp16/fp8 weights require.
Quantized by RealRebelAI Β· GitHub Β· YouTube
β‘ Load with the GGUF Unet Loader (city96's ComfyUI-GGUF β
Unet Loader (GGUF)). Place the.ggufinComfyUI/models/unet/.
Quant tiers
| Tier | Approx size | Notes |
|---|---|---|
| Q2_K | ~6 GB | Smallest β runs on minimal VRAM, expect quality loss |
| Q3_K_M | ~8.1 GB | Budget tier, better coherence than Q2 |
| Q4_K_M | ~10 GB | Recommended daily driver |
| Q5_K_M | ~12 GB | Sweet spot above Q4 |
| Q6_K | ~14 GB | Higher fidelity |
| Q8_0 | ~17 GB | Closest to fp16 |
The loader memory-maps the model, so a larger file costs disk and streaming time, not resident RAM.
Required files (NOT included in this repo)
Download each of these separately and place them in the listed ComfyUI folder.
π Text Encoder
ComfyUI/models/text_encoders/
https://huggingface.co/Kijai/WanVideo_comfy/blob/main/umt5-xxl-enc-fp8_e4m3fn.safetensors
ποΈ LoRA (LightX2V step/cfg distill)
ComfyUI/models/loras/
https://huggingface.co/lightx2v/Wan2.1-I2V-14B-480P-StepDistill-CfgDistill-Lightx2v/blob/main/loras/Wan21_I2V_14B_lightx2v_cfg_step_distill_lora_rank64.safetensors
π― SAM 3.1 Multiplex
ComfyUI/models/sam/
https://huggingface.co/Comfy-Org/sam3.1/blob/main/checkpoints/sam3.1_multiplex_fp16.safetensors
ποΈ CLIP Vision
ComfyUI/models/clip_vision/
https://huggingface.co/Comfy-Org/Wan_2.1_ComfyUI_repackaged/blob/main/split_files/clip_vision/clip_vision_h.safetensors
π¨ VAE
ComfyUI/models/vae/
https://huggingface.co/Comfy-Org/Wan_2.1_ComfyUI_repackaged/blob/main/split_files/vae/wan_2.1_vae.safetensors
β Optional: SCAIL-2 DPO LoRA (untested)
ComfyUI/models/loras/
https://huggingface.co/Comfy-Org/SCAIL-2/blob/main/loras/wan2.1_SCAIL_2_DPO_lora_bf16.safetensors
Folder structure
ComfyUI/models/
βββ unet/
β βββ SCAIL-2-Q4_K_M.gguf β from this repo
βββ text_encoders/
β βββ umt5-xxl-enc-fp8_e4m3fn.safetensors
βββ loras/
β βββ Wan21_I2V_14B_lightx2v_cfg_step_distill_lora_rank64.safetensors
β βββ wan2.1_SCAIL_2_DPO_lora_bf16.safetensors (optional)
βββ sam/
β βββ sam3.1_multiplex_fp16.safetensors
βββ clip_vision/
β βββ clip_vision_h.safetensors
βββ vae/
βββ wan_2.1_vae.safetensors
Generated Examples
Here are some outputs from the model:
Notes
WEIGHT NOT MERGEDwarning onpatch_embeddingis harmless. ComfyUI builds a 36-channel patch embedding and concatenates the mask channels at runtime; the model fills them internally. The stored 20-channel weight is expected. Generation proceeds normally.- The colored mask is a required input even in single-character Animation Mode β don't remove it from the workflow.
- Set width and height explicitly (both divisible by 16; 832Γ480 is a good 480p start).
- The
SCAIL2ColoredMasknode may require a recent / nightly ComfyUI build.
Credits
- Model: zai-org / SCAIL-2
- GGUF quantization: RealRebelAI
- GGUF tooling: city96 / ComfyUI-GGUF
- Downloads last month
- 5,171
2-bit
3-bit
4-bit
5-bit
6-bit
8-bit
Model tree for realrebelai/SCAIL-2_GGUF
Base model
zai-org/SCAIL-2