Qwen2.5-7B-RYS-8-12

Qwen2.5-7B-Instruct with layers 8-12 duplicated. The math circuit runs twice on every forward pass.

28 base layers β†’ 32 after duplication. No training, no merging, no weight changes.

Math +10% (0.5653 β†’ 0.6645). EQ +1.0. Reasoning held at 94.12%.

Results

Metric Baseline RYS (8,12) Delta
Math 0.5653 0.6645 +10%
EQ 89.69 90.66 +0.97
Reasoning 94.12% 94.12% 0.00

The math specialist. The first sub-14B model the method was extended to. David Ng demonstrated layer-duplication on Qwen2-72B; we found it works just as cleanly here, 10Γ— smaller. Of 51 swept configurations, (8,12) is the one that gets the math circuit alone β€” duplicate that 4-layer block and nothing else trades down. The original GitHub-issue writeup describing exactly this result lives in the v2 corpus: docs/github-issue-draft.md.

Usage

llama-server -m Qwen2.5-7B-RYS-8-12-Q4_K_M.gguf -ngl 99

Full sweep data

51 configurations tested. Full sweep data in the v2 corpus dataset. For vLLM-serving with AWQ quantization, see companion repo john-broadway/Qwen2.5-7B-RYS-8-12-AWQ.

Part of the RYS Sovereign Collection v1.


Where this sits in the Sovereign Collection

v1 β€” Qwen2.5 cross-scale + Qwen3-32B headline. Four sizes from 0.5B to 32B; RYS works at every scale, with the lift size and dimension shifting by baseline:

  • 0.5B β†’ EQ specialist
  • 1.5B β†’ balanced daily driver
  • 7B β†’ math specialist via (8,12)
  • 32B β†’ the headline "Big Boy"

v2 β€” cross-architecture extension. 21 model variants across 10 architecture families. Headline: weak baselines lift more, in their weakest dimension. β†’ john-broadway/rys-sovereign-collection-v2

Credit

John Broadway, with collaboration from Claude (Opus 4.6 in April 2026 build; Opus 4.7 in May 2026 analysis and publication). Original RYS method by David Ng on Qwen2-72B; sweep toolkit by alainnothere.

Downloads last month
136
GGUF
Model size
9B params
Architecture
qwen2
Hardware compatibility
Log In to add your hardware

4-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for john-broadway/Qwen2.5-7B-RYS-8-12-GGUF

Base model

Qwen/Qwen2.5-7B
Quantized
(311)
this model

Collection including john-broadway/Qwen2.5-7B-RYS-8-12-GGUF