Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
z-lab 's Collections
DFlash
ParoQuant

ParoQuant

updated about 7 hours ago

Pairwise Rotation Quantization for Efficient Reasoning LLM Inference

Upvote
4

  • ParoQuant: Pairwise Rotation Quantization for Efficient Reasoning LLM Inference

    Paper • 2511.10645 • Published Nov 13, 2025 • 6

  • z-lab/Qwen3.5-4B-PARO

    1B • Updated 2 days ago • 121 • 1

  • z-lab/Qwen3.5-0.8B-PARO

    Image-Text-to-Text • 0.4B • Updated 2 days ago • 265

  • z-lab/Qwen3.5-2B-PARO

    1B • Updated 2 days ago • 34

  • z-lab/Qwen3.5-9B-PARO

    3B • Updated 2 days ago • 24

  • z-lab/Qwen3-8B-PARO

    Text Generation • 1B • Updated 1 day ago • 1.16k

  • z-lab/Qwen3-4B-PARO

    Text Generation • 0.9B • Updated 1 day ago • 346

  • z-lab/Qwen3-0.6B-PARO

    0.2B • Updated 2 days ago • 196

  • z-lab/Qwen3-1.7B-PARO

    Text Generation • 0.5B • Updated 1 day ago • 98

  • z-lab/Qwen3-14B-PARO

    Text Generation • 2B • Updated 1 day ago • 110

  • z-lab/Llama-3.1-8B-Instruct-PARO

    Text Generation • 1B • Updated 1 day ago • 19

  • z-lab/Meta-Llama-3-8B-PARO

    Text Generation • 1B • Updated about 7 hours ago • 3

  • z-lab/Llama-2-7b-hf-PARO

    Text Generation • 1B • Updated 1 day ago • 18

  • z-lab/DeepSeek-R1-Distill-Llama-8B-PARO

    Text Generation • 1B • Updated about 7 hours ago • 1

  • z-lab/paroquant-checkpoints

    Updated 2 days ago
Upvote
4
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs