Punk Records Atom Token LLM (v2.2)

Separable memory for language agents — session state in a neural ψ field, not transcript token replay.


Paper	Full paper + figures (PDF) · Research overview · Evidence pack
Live demo	wasnaga/atom-token-demo
Author	Umberto Canessa Cerchi · umberto.canessa@gmail.com
Stack	~3.5M params · v2.2.0 · Release gates PASS (100%) · evidence exported 2026-07-04

Abstract

Chat agents remember by re-tokenizing prior turns. Cost grows with session length; latent multi-agent systems share opaque KV caches that resist audit.

Atom Token LLM uses a native unit — the AtomToken (σ, ψ, π, φ) — and a ψ field of content-addressable cells (κ, ν). Surface text appears only at RENDER egress. A learned tension field routes writes and reads; a ψ-predictor enables imagination-based plan_act without executing destructive actions.

On deterministic multi-agent benchmarks (500 TEST societies, held-out seeds):

Portable mentorship: ψ-transfer 500/500 vs control 0/500
Latent guardrail: post-lock deploy refused 500/500
Stale-belief rent: planner 0% stale redeploy vs reactive 100%
Dream quarantine: 0 violations
Routing @ banks 8–64: 100% routing recall and read-given-routed
Horizon recall: 100% @ 8 facts, 94–97% @ 16–32; 97.5% token savings vs transcript @ 32 facts
Society input savings: 31%

Full write-up: docs/RESEARCH.md.

Why this exists

Mainstream chat LLMs remember by replaying the conversation as tokens every turn. Atom Token LLM inverts the stack: each inference step takes only the latest user text + ψ state.

Layer	Symbol	Role
Index	κ = (κ_coarse, κ_fine)	Learned hierarchical address
Substance	ν	Neural value — the fact as stored
Surface	text	RENDER egress only

Product invariant: no forward pass over prior chat as BPE token IDs. Session evolution is lawful ψ transition; weights provide ability, ψ accumulates experience.

Architecture (v2.2)

Component	Role
Atomizer v4	`(π, spans)` — multi-slot facet + value
κ encoder	Hierarchical coarse→fine routing
Tension field v4	WriteMode on PLANT; pull-first QUERY
ValueCodec	ν encode/decode + re-encode verified search
ψ-predictor	Latent transition for plan_act
Unified ψ-catalog + CRS	Chitchat without transcript replay

Hybrid memory retrieval (Findings #9–#11):

κ_coarse cascade — bounded window k=8 over unbounded bank
Structural write/read laws — ν-matched UPDATE, cross-facet veto
Lexical anchor — exact token match on OOD facets when learned read disagrees
Value-shape law — IPv4 as ip:<addr>; deploy grounding veto in plan_act

Persistence: portable .atom file (cells + policy + atom log).

Results at a glance (TEST seeds)

Society (Act 4 · seeds 4500–4999)

Claim	Metric
Completion (reactive / planner / dreamer)	500/500 each
ψ-transfer mentorship	500/500 · control 0/500
Fact diffusion	500/500 · mean latency 1.0 rounds
Post-lock deploy refused	500/500
Stale redeploy after unobserved lock	planner 0% · reactive 100%
Dream quarantine	0 violations
Transcript arm (visibility ablation)	0/500
Token savings	31.0% (36,500 / 117,687)

Routing cascade (G-routing)

Bank	Window	Routing recall	Read \| routed
8	8	40/40 (100%)	40/40 (100%)
16	8	80/80 (100%)	80/80 (100%)
32	8	160/160 (100%)	160/160 (100%)
64	8	320/320 (100%)	320/320 (100%)

Horizon (Act 1)

N facts	Recall	Baseline tok	Kernel tok	Savings
8	8/8 (100%)	7,140	694	90.3%
16	15/16 (94%)	26,969	1,376	94.9%
32	31/32 (97%)	119,542	3,041	97.5%

Codec & plan_act

Gate	Result
G-codec-fact-exhaustive-pool	3942/3942 (100%)
G-codec-society-deploy-roundtrip	1067/1067 (100%)
G-plan-act-pick	500/500
G-plan-act-refuse	500/500
G-wm-* (outcome, manifold, rollout, counterfactual)	100%

Raw JSON: evidence/ · regenerated via export_evidence + run_release_gates.

Download weights

pip install -U "huggingface_hub[cli]" torch pyyaml numpy
huggingface-cli download wasnaga/atom-token-llm-v1 --local-dir ./checkpoints/lane_d

Set LANE_D_CHECKPOINT_DIR to the download directory before running gates locally.

Checkpoints in this repo

File	Role
`atomizer_v4.pt`	Required — π + multi-slot (facet, value)
`kappa_encoder.pt`	Required — hierarchical κ
`tension_field_v4.pt`	Required — memory read/write routing
`value_codec.pt`	ν encoder + verified decode
`sigma_encoder.pt`	Query↔plant σ binding
`psi_predictor.pt`	World model / plan_act
`unified_chitchat_v16.pt`	Chitchat primary — ψ-catalog + CRS
`stack.json`	Manifest (v2.2.0)

See stack.json for full manifest.

Reproduce eval

From punk-records-atom-token with pinned checkpoints:

export LANE_D_CHECKPOINT_DIR=/path/to/checkpoints/lane_d
export LANE_D_ALLOW_HF_DOWNLOAD=0
export PYTHONPATH=.

python -u -m lane_d.product.eval.run_release_gates      # ~18 min — codec, routing, wm, society
python -u -m lane_d.society.eval.export_evidence          # ~22 min — Acts 1–4 paper tables

Citation

If you use this work, please cite:

Canessa Cerchi, U. (2026). Separable Memory for Language Agents: ψ Fields, Latent World Models, and Multi-Agent Diffusion (Draft v1.0).
Paper: https://huggingface.co/wasnaga/atom-token-llm-v1/blob/main/paper/atom-token-llm-full-paper.pdf
Model: https://huggingface.co/wasnaga/atom-token-llm-v1 · Demo: https://huggingface.co/spaces/wasnaga/atom-token-demo

BibTeX:

@misc{canessa2026atomtoken,
  author       = {Canessa Cerchi, Umberto},
  title        = {Separable Memory for Language Agents},
  year         = {2026},
  howpublished = {Hugging Face},
  url          = {https://huggingface.co/wasnaga/atom-token-llm-v1},
  note         = {Draft v1.0; PDF in repo /paper/}
}

Limitations

Deterministic ops deploy simulator · primarily 2-agent societies · ~3.5M parameter stack. Not a frontier general LLM. Horizon has one residual miss each at N=16 and N=32 on procedural paraphrase surfaces. See paper §9.

Downloads last month: -; Downloads are not tracked for this model. How to track

wasnaga
/

atom-token-llm-v1