Qwen3-1.7B-RYS-7-10

RYS-enhanced Qwen3-1.7B with layers 7-10 duplicated. 28 layers expanded to 31. Zero training, zero weight changes.

Math improvement: +9.1% with near-stable reasoning.

Results

Metric Baseline RYS (7,10) Delta
Math 0.4626 0.5540 +9.1%
EQ 82.77 83.71 +0.94
Reasoning 82.35% 76.47% -6%

The daily driver. Nearly 10% math improvement from a 1.2GB model.

Usage

llama-server -m Qwen3-1.7B-RYS-7-10-Q4_K_M.gguf -ngl 99

Full sweep data

51 configurations tested. Sweep results published with the model files.

Part of the v2 Qwen3-family cohort โ€” parallel to v1 Qwen2.5 cross-scale; a Qwen3-only expansion from April 2026.

Where this sits in the Sovereign Collection

v1 โ€” Qwen2.5 cross-scale + Qwen3-32B headline crossover (the original v1 intent per the 2026-04-11 writeup). 5 model repos on HuggingFace; see john-broadway.

v2 Qwen3-family cohort (this card's cohort โ€” parallel Qwen3-family RYS-applied weights, April 2026):

v2 cross-architecture corpus (21 model variants spanning 10 architecture families): john-broadway/rys-sovereign-collection-v2

Attribution: John Broadway, with collaboration from Claude (Opus 4.6 in April 2026 build; Opus 4.7 in May 2026 cross-architecture analysis and family-relabeling). Original RYS method by David Ng on Qwen2-72B; sweep toolkit by alainnothere.


v2 cross-architecture context (2026-05-13)

This model's place in the v2 curve: baseline reasoning 82.35%, peak RYS ฮ” +11.76%. The (7,10) configuration is in the early-mid layer band typical of 28-layer architectures.

Across the 21 model variants (10 architecture families) surveyed in john-broadway/rys-sovereign-collection-v2:

  • Pearson r(baseline reasoning, peak RYS lift) = โˆ’0.726. Weak baselines lift more, in their weakest dimension.
  • Three RYS-recoverable suppression mechanisms identified: under-training scale, MoE routing inefficiency, specialization training trade-off.
  • One published negative result (SmolLM2-1.7B). RYS is not universal.

v2 attribution: John Broadway, with cross-architecture analysis by Claude (Opus 4.7). Original RYS method by David Ng; circuit-finder toolkit by alainnothere.

Downloads last month
202
GGUF
Model size
2B params
Architecture
qwen3
Hardware compatibility
Log In to add your hardware

4-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for john-broadway/Qwen3-1.7B-RYS-7-10-GGUF

Finetuned
Qwen/Qwen3-1.7B
Quantized
(268)
this model

Collection including john-broadway/Qwen3-1.7B-RYS-7-10-GGUF