You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Log in or Sign Up to review the conditions and access this model content.

phi-3.5-moe-instruct-uc-v3-bpw5

Quick start β€” try Sipsa near-lossless 5-bit in 30 seconds

pip install ultracompress
uc try sipsa-qwen3-0.6b      # 30s demo (free, no signup, no GPU)

This calls Sipsa's free inference API on a compressed model β€” see what 1.004x PPL ratio looks like in practice.

To run this model via the API: uc try sipsa-phi-3.5-moe

Browse the full catalog of try-able models: uc catalog

Need this model in production?

Phase 0 POC: $5K / 5 business days / SHA-256 reproducible-reconstruction audit (deterministic decode to the validated artifact).

Contact: founder@sipsalabs.com | sipsalabs.com/poc

Near-lossless 5-bit compression (~1% perplexity; lossy) of microsoft/Phi-3.5-MoE-instruct β€” produced by Sipsa Labs (UltraCompress).

  • Reproducible, cryptographically verifiable reconstruction: a deterministic decode to the SHA-256-pinned validated artifact (not bit-identical to the original bf16 model).
  • Independently perplexity-verified end-to-end.
  • PPL ratio vs bf16: 1.00129Γ— (FineWeb-edu held-out tail, seq_len=1024, seed=42, n=30 prompts).
  • License: BUSL-1.1 + Additional Use Grant.
pip install ultracompress

Documentation & verification: https://sipsalabs.com β€” https://github.com/sipsalabs/ultracompress

Patents pending.

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for SipsaLabs/phi-3.5-moe-instruct-uc-v3-bpw5

Finetuned
(6)
this model