Qwen2.5-3B (SpectralQ 6-bit) This model contains experimental 6-bit frequency-domain weights quantized using the Transformer Spectral Scaling Law. It utilizes 256-width DCT macroblocks and a 30-bit chunk-major packed layout.

⚠️ IMPORTANT: HOW TO USE ⚠️ This model cannot be loaded with standard Transformers or vLLM. It requires the custom SpectralQ inference engine, which includes the C++ SRAM bit-unpacking and IDCT projection kernels.

To run this model, please visit the GitHub repository: [👉 SpectralQ 👈]

Downloads last month: 35

Safetensors

Model size

1B params

Tensor type

F32

I32

BF16

F16

BOOL

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for AUser0/Qwen2.5-3B-SpectralQ-6bit

Base model

Qwen/Qwen2.5-3B

Finetuned

(423)

this model