Qwen3.5-27B-metro-v24

QLoRA fine-tune of Qwen3.5-27B for the MetroLLM-Bench transit-kiosk task. v24 is the leakage-free retraining used in the MetroLLM-Bench paper (teacher traces drawn only from the 717-case training partition; 238 cases held out). Supersedes continker/Qwen3.5-27B-metro-v23.

Held-out results (n=238, mean of 2 seeds)

Metric 27B base 27B + v24 Δ
Tier-1 92.32 91.41 -0.91
Composite 90.60 89.72 -0.88

At 27B, PEFT regresses versus the base model — the negative end of the capacity-ceiling curve reported in the paper. This adapter is published for completeness and to reproduce that result. For deployment at this size, use the base Qwen3.5-27B; for an efficient fine-tuned student, use the 4B or 9B v24 models, which gain over their bases.

Contents

  • adapter/ — LoRA adapter (rank 16, α 32; QLoRA 4-bit NF4) + tokenizer + chat template
  • Qwen3.5-27B-metro-v24-Q4_K_M.gguf — merged GGUF (16 GB)
  • training_summary.json

The LoRA adapter keys use the `.language_model.` prefix; strip it to load onto text-only Qwen3.5-27B. Apache 2.0.

Links

Downloads last month
25
GGUF
Model size
27B params
Architecture
qwen35
Hardware compatibility
Log In to add your hardware

4-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for continker/Qwen3.5-27B-metro-v24

Base model

Qwen/Qwen3.5-27B
Adapter
(81)
this model

Collection including continker/Qwen3.5-27B-metro-v24