gary-neuron-emergent
The emergence-trained cortex behind neuron-db โ a 1.13M-parameter, pure-NumPy GPT that learned to read its context window and copy values out of it. It is the optional "thinking" tier that sits on top of the neuron-db store: the store retrieves a small working set, this cortex generates the answer over it.
What it is
- arch: gpt-numpy โ 8 layers, 4 heads, dim 96, 384-token context, vocab 2048
- params: 1,128,384
- trained: step 33,597 (curriculum: copy-from-window + masked answer loss + abstention)
- emergence: in-context QA probe reached 5/6 โ it copies unseen values from the window
(
how many participants? -> 84,512,what is the wifi password? -> vekam73) and learned to abstain ("i don't know right now.") when the answer isn't present. - val: answer-token loss
0.20; chat perplexity held (val_soda 2.2)
Files
cortex.npz float32 weights (P/ keys; gpt-numpy layout)
petite_vocab.json byte-level BPE vocab (2048)
petite_merges.txt BPE merges
config.json E/H/L/BLK, vocab, param count, trained step
Use
from huggingface_hub import snapshot_download
import numpy as np, gpt_numpy as G # gpt_numpy.py from gary-neuron-chat / neuron-db
from tokenizers import ByteLevelBPETokenizer
d = snapshot_download("gary23w/gary-neuron-emergent")
z = np.load(f"{d}/cortex.npz", allow_pickle=True)
P = {k[2:]: z[k] for k in z.files if k.startswith("P/")}
CFG = dict(E=96, H=4, L=8, BLK=384)
tok = ByteLevelBPETokenizer(f"{d}/petite_vocab.json", f"{d}/petite_merges.txt")
# feed it a working set + a question, greedily decode:
ids = tok.encode("U: the launch is on Friday\nG: noted.\nU: when is the launch?\nG:").ids
# ... forward + argmax loop -> "Friday"
Or point neuron-db's bridge at a checkout:
export NEURON_MODEL_DIR=<this download dir>
python -c "from neuron_db.bridge import GaryNeuronBridge; b=GaryNeuronBridge(); ..."
Honest notes
- It was trained on a ~2k everyday-token vocabulary in a
U:/G:fact format. It excels at copying normalized facts out of a bounded window; it is not a general chatbot. - For exact recall you don't need it at all โ neuron-db's store returns the value deterministically. This cortex is for generation/association over the working set.
MIT. Part of the gary-neuron family by gary23w.
- Downloads last month
- 4
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐ Ask for provider support