https://huggingface.co/AlexWortega/SIQ-1-35B

#2612

by el4 - opened 3 days ago

Discussion

el4

3 days ago

Pulled from readme:

SIQ-1-tiny-35b 🪽

A tiny universal agent — autoresearch, coding, reasoning.

SIQ-1-tiny-35b is a tiny MoE — 35B total but only ~3B active per token — distilled to be a strong
universal agent: equally at home running autonomous ML research (autoresearch), writing and debugging code,
tool-use / agentic workflows, and hard reasoning. Despite its 3B active footprint it matches or beats much
larger peers on core reasoning, sycophancy-resistance, and agentic coding — at a lower token cost.

Benchmark	SIQ-1-tiny-35b	Nex-N2-mini	Qwen3.6-35B
General & Reasoning
GPQA-Diamond (Q4, co-measured)	70.2	67.2	68.2
GPQA-Diamond (bf16, full eval)	90.2	82.6	—
IFEval (inst-loose)	89.5	89.1	—
tok/question (GPQA, mean)	3158 ✅	3363	3500
Agentic coding
vibetest (Claude-judge, /10)	9.21	8.12	—
Ideation (autoresearch)
Opus-judge ideation (/100)	30.2	—	10.2 (base)

bf16 + tuned harness scores higher (90.2 GPQA); the Q4 row is the apples-to-apples co-measured comparison shown
in the figure. Terminal-Bench 2.1 (Harbor, terminus-2, k=5) is in progress.

RichardErkhov

3 days ago

oh it got fixed, right? let me queue again
hopefully it's actually a siq (pun intended) model

It's queued!

You can check for progress at http://hf.tst.eu/status.html or regularly check the model
summary page at https://hf.tst.eu/model#SIQ-1-35B-GGUF for quants to appear.

el4

3 days ago

Tested it- seems to be very siq lol

did very 10 complex tasks in one turn- which is crazy now that i think abt it 🔥

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment