Macron T2 32K

Mobile GGUF release for the Galvaniq Android app.

Files

  • Macron_T2_32K-i2s-f16.gguf
  • Macron_T2_32K-i2s-f16-meta.json

Build

  • Runtime: bitnet.cpp
  • Format: I2_S ternary linear tensors with F16 token embedding
  • Context length: 32768
  • RoPE: YaRN, factor 8.0, original context 4096, theta 500000
  • Size: 1187801440 bytes
  • SHA256: e7901384bdfc226018c430fa9e5ea88051a682d06c02e33f67cb974f1a046b66

Verification

  • Tensor gate: 210 I2_S tensors, 0 Q8_0 tensors
  • token_embd.weight: F16
  • output.weight: absent
  • GGUF metadata includes bitnet-25.context_length = 32768
  • Modal smoke test loaded successfully with n_ctx = 32768

This release is intended for the Galvaniq app's BitNet runtime path, not as a generic llama.cpp compatibility claim.

Downloads last month
17
GGUF
Model size
2B params
Architecture
bitnet-25
Hardware compatibility
Log In to add your hardware

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support