SeedVR2-3B (MLX-Swift) β€” int8

int8-quantized MLX-Swift weights for SeedVR2-3B (ByteDance, ICLR 2026) β€” one-step diffusion super-resolution. The on-device variant: ~half the size, near-lossless. For the seedvr2-mlx-swift package (MLXEngine / ForgeUpscaler). fp16 base: SeedVR2-3B-mlx.

  • Quantization: transformer DiT Linears int8, group size 64 (the small vid_in.proj and the VAE are kept fp16). Declared in config.json ("quantization": {"bits": 8, "group_size": 64}); the loader applies it automatically.
  • Files: transformer.safetensors (int8, ~3.9 GB vs 7.9 GB fp16) Β· vae.safetensors (fp16) Β· pos_emb.safetensors Β· config.json.
  • Quality: int8 t_out cosine vs fp16 = 0.9999749 (near-lossless; int4 by contrast degrades this model to ~22.7 dB β€” use int8 on-device). Reload round-trip cosine 1.0.

Usage

import SeedVR2MLX
let upscaler = try SeedVR2Upscaler(directory: weightsDir)   // detects int8 from config, applies quantize on load
let out = upscaler.upscale(processedImage: img, seed: 42)

Provenance & license

ByteDance Seed (SeedVR2, ICLR 2026, ByteDance-Seed/SeedVR, Apache-2.0) β†’ fp16 numz/SeedVR2_comfyUI β†’ MLX ref filipstrand/mflux β†’ MLX-Swift port + int8 conversion by MVS Collective (xocialize). Format/precision-converted weights (not a new model); Apache-2.0 applies.

Downloads last month
-
MLX
Hardware compatibility
Log In to add your hardware

Quantized

Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Collection including mlx-community/SeedVR2-3B-mlx-int8