Antaḥkaraṇa-Net

🔗 Model & code: 🤗 huggingface.co/deepakdsoni/antahkarana-v1 · github.com/deepaksatna/antahkarana

A working AI architecture built on the 2,500-year-old Vedic / Sanskrit model of mind — the antaḥkaraṇa ("inner instrument"). One agent that learns continually without forgetting, scales its effort and mood to its own state, perceives without hallucinating, runs embodied and even on a spiking (neuromorphic-style) substrate — all validated on real hardware. A deep-research proof-of-concept: the foundation is built and measured honestly; scaling it is the next chapter.

📦 Model family

Model	What
antahkarana-v1	the original architecture + v1 vision models — the most stable continual learner (only positive backward transfer)
antahkarana-v2	accuracy-recovering v2 (36.5M) — matches SOTA accuracy at ~3× less forgetting
antahkarana-7B	the architecture scaled to a 7B language model

🔬 Scale-Up Benchmark & Models

The original POC is now scaled to real WideResNets (36.5M–52.6M params) and benchmarked by live inference on the trained checkpoints — 11 models, 7 capabilities each, on a single NVIDIA A10. Full report with methodology and per-model tables: BENCHMARK_REPORT.md. End-to-end live test on the published model: E2E_TEST_REPORT.md (run it: python3 e2e_demo.py).

Models in this repo

File	Params	Dataset / setup	Forgetting ↓
`antahkarana-36.5M-cifar100-wrn28-10.pt`	36.5M	CIFAR-100, 10-task (primary)	0.565→0.018 (31.8×)
`antahkarana-52.6M-cifar100-wrn28-12.pt`	52.6M	CIFAR-100, 10-task	0.542→0.021 (25.4×)
`antahkarana-36.5M-tinyimagenet-wrn28-10.pt`	36.5M	Tiny-ImageNet, 10-task	0.503→0.017 (29.2×)
`antahkarana-36.5M-cifar100-20task-wrn28-10.pt`	36.5M	CIFAR-100, 20-task lifelong	0.603→0.049 (12.3×)

Load any with the self-contained load_akn.py (only needs PyTorch):

from load_akn import load
model, ck = load("antahkarana-36.5M-cifar100-wrn28-10.pt")
logits = model(x, task=0)        # x: (N, 3, 32, 32)
print(ck["results"]["agent"])

Catastrophic forgetting — cut 12–41× vs a naive baseline

Two model sizes, two datasets, 10- and 20-task streams. The agent forgets almost nothing (0.01–0.05); the naive net collapses to its last task.

Capability scorecard — 10 of 11 models pass 7/7

The single 6/7 (āśrama_s0) is a borderline threshold artifact on avg-accuracy (0.592 vs the 0.60 = 3×-chance bar); its memory/abstention/calibration all pass.

Per-task retention — nothing collapses

Final accuracy of every task after the full stream (10-task and 20-task). All stay well above chance — the model remembers task 0 after learning task 19.

Anti-hallucination (pramāṇa) — abstains instead of guessing

Calibrated abstention: gated accuracy 0.91–1.00 when it commits; abstains on up to 99.7% of out-of-distribution (SVHN) inputs.

Legible mind-state — plasticity falls across the four āśrama life-stages

The 20-task lifelong run: plasticity headroom drops childhood→old-age (brāhmacarya→gṛhastha→vānaprastha→saṃnyāsa) while forgetting stays low and bounded.

Headline (5-/2-seed means): forgetting 0.589→0.0146 (~41×) at 36.5M; 27× at 52.6M; 29× on Tiny-ImageNet; 12.5× on the 20-task lifelong run. Gated accuracy 0.93–0.96, calibration ECE cut ~5–7×, OOD abstention up to 99.7% — all by live inference on the released checkpoints.

1. What it is, in one breath

Modern AI already has the pieces of a mind — attention, memory, decision, control — but no principled way to wire them into one self-regulating, lifelong-learning whole. The Vedic model of mind is exactly such a wiring diagram. Antaḥkaraṇa-Net implements it: every Sanskrit faculty becomes a real ML module, assembled into a single agent.

Sanskrit faculty	What it does	ML module
manas (मनस्)	attention / perception gate	precision-weighted attention encoder
buddhi (बुद्धि)	discrimination, decision	evidence-accumulation / executive
ahaṃkāra (अहंकार)	the "I-maker" / self-model	identity latent
chitta (चित्त)	memory & the subconscious	continual memory (EWC + decay)
guṇas (सत्त्व·रजस्·तमस्)	the three qualities	one controller of plasticity / explore / consolidate
tapas (तपस्)	concentrated effort	effort-allocation by need
divya-dṛṣṭi / pramāṇa	valid extended perception	calibrated abstention gate (anti-hallucination)
turīya (तुरीय)	the witness	reward-invariant identity monitor
āśrama	life-stages	lifelong plasticity schedule (childhood → old age)

2. Why — the motivation

Two motivations meet here.

(a) The Vedic psychology is a stunningly good systems diagram of mind. The Upaniṣads, Sāṃkhya and Yoga decompose cognition into a four-fold inner instrument, separate awareness from processing (the "hard problem", 2,000 years early), give a four-state model of consciousness (waking / dream / deep-sleep / turīya), and a real theory of the subconscious (saṃskāra / vāsanā). It even contains a developmental law — the āśramas — for how a mind should keep improving across a whole lifetime. (The full study is in philosophy/: texts & mantras, the modern-neuroscience cross-walk, the Sanskrit formulae, the modern equations, and the architecture derivation.)

(b) Today's AI has matching blind spots — and the Vedic model addresses each one:

Limitation of today's models	Antaḥkaraṇa-Net's structural answer
No continual learning (frozen after training)	chitta: incremental updates, never retrain from zero
Catastrophic forgetting	saṃskāra importance with growth and decay
Full-retrain energy cost (GWh)	update + sleep-time consolidation; spiking substrate
Dense, always-full-power compute	guṇa-scaled effort; event-driven spikes
No self-monitoring across modes	turīya reward-invariant monitor
Confident hallucination	pramāṇa validity gate (abstain, don't confabulate)
Disembodied (no action→consequence)	karma loop + battery→guṇa in the embodied agent

3. How it works — the architecture

The backbone thinks; the Vedic layer remembers, regulates, perceives, and monitors around it:

Perception → decision pipeline: manas (attention) → chitta (memory) → buddhi (decision) ↔ ahaṃkāra (self).
One guṇa controller turns a 3-vector (sattva, rajas, tamas) into all the learning dynamics (plasticity, exploration, consolidation, pruning) — and it is forgetting-aware (protect hard tasks, back off on easy ones) and, when embodied, driven by the battery (low battery → tamas → conserve).
Four operating states: jāgrat (wake/act) → svapna (dream/replay) → suṣupti (sleep/consolidate).
Two safety overlays: the pramāṇa gate (extended perception must be valid knowledge, not fancy) and the turīya witness (a reward-invariant identity monitor).
The āśrama schedule keeps plasticity non-zero for life and re-opens critical periods on novelty — the "always enhanceable, childhood → old age" property.

Because the control layer is backbone-agnostic, the same agent runs on a toy MLP, a real CNN, an RL policy, or a spiking net — which is why embodiment and neuromorphic are extensions of one model, not separate builds.

4. What we achieved — results (all from real runs)

Phase	Result	Status
Integration	one agent, all faculties in a single wake/dream/sleep loop	✅
Phase II — real CNN + data (GPU)	Split-CIFAR: forgetting 0.217 → 0.008 (~27×); +9–10 pts accuracy	✅
Forgetting-aware controller	fixed the easy-task over-regularization (MNIST 0.974 → 0.986), kept the CIFAR win	✅
Continual benchmark	consolidation cuts forgetting ~60–80× (0.242 → 0.003)	✅
Divya-dṛṣṭi + Pramāṇa	accepted-prediction accuracy rises 0.80 → 0.91, abstains on blind inputs	✅
Track B — embodiment	karma loop (success 1.00 vs random 0.30); battery→guṇa (ε 0.087 hungry → 0.122 charged); retention across 4 regimes 0.38 → 1.00	✅
Track C — neuromorphic (spiking)	the spiking net works — matches ANN accuracy (0.943 vs 0.929) at 10.7% spike density; conservative ~1.9× software energy floor	✅ spiking proven · ⏳ only chip deployment pending

About Track C "pending". The spiking network is done and working — it runs and matches the normal network's accuracy, which proves the architecture runs on event-driven (neuromorphic-style) computation. What is pending is only deployment to a real neuromorphic chip (Intel Loihi 2 / BrainChip Akida), which we don't have. The ~1.9× is a deliberately conservative software estimate (per-operation energy only); the famous 100–1000× neuromorphic figures come from chip-only effects (event-skipping, in-memory compute, no data movement) that a GPU simulation cannot reproduce — so we report the floor, not the headline. Un-pending it needs a neuromorphic board + a port via Intel Lava; it is the only step in the whole project gated on hardware rather than code.

Full numbers, seeds, and the honest caveats are in RESULTS.md; the staged plan is in ROADMAP.md.

5. Why it's different (and why that matters)

Most "continual learning" papers fix one mechanism. This is a single agent that unifies memory, effort, control, perception-validity and self-monitoring under one interpretable scheme — with an observable "mind-state" trace (life-stage, guṇa mix, plasticity, witness drift) you can read as it lives.
It learns forever without forgetting — and the controller learns when to protect, so it doesn't over-regularize easy tasks (a failure mode we caught and fixed honestly).
It is honest about hallucination. The pramāṇa gate is the engineering form of the Nyāya rule that extraordinary perception must be a valid means of knowledge — it abstains rather than confabulate.
It carries from supervised → embodied → spiking unchanged, because the architecture (not a trick) is the contribution.
It is a research POC, stated plainly. Strong faculties (consolidation, replay, pramāṇa, forgetting-aware control, task-conditioned policy) are proven; modest ones (tapas, decay) and conceptual ones (āśrama, the witness) are labeled as such — see the component scorecard in RESULTS.md.

6. How it advances current AI research

A research proof-of-concept — but one that speaks directly to several of the field's most active open problems, offering a principled, reproducible framework rather than a point fix.

Continual / lifelong learning. Catastrophic forgetting is one of ML's hardest open problems — today's large models are effectively frozen after training and must be expensively re-trained to absorb new knowledge. This unifies importance-based consolidation, rehearsal, and a forgetting-aware controller into a single agent that learns indefinitely, cutting forgetting ~6–80× in our runs.
Compute & energy sustainability. Frontier-model (re)training consumes gigawatt-hours. The architecture offers two complementary levers — incremental updates (no retrain-from-scratch) and an event-driven spiking path (validated in software, matching ANN accuracy) — a concrete route toward order-of-magnitude lower inference energy on neuromorphic hardware.
Reliability & hallucination. Models routinely assert what they don't know. The Pramāṇa validity gate provides calibrated abstention — "know when you don't know" — a deployable anti-hallucination primitive grounded in epistemology (extended perception must be a valid means of knowledge, not fancy).
Safety & alignment. The turīya reward-invariant monitor, plus the siddhi principle — capabilities must stay subordinate to the goal (Yoga Sūtra 3.37) — anticipate modern instrumental-goal / mesa-optimization concerns and provide a structural oversight pattern, not an afterthought.
Self-regulation & adaptivity. A single interpretable guṇa signal auto-balances exploration, consolidation, and conservation; the controller learns when to protect (fixing over-regularization on easy tasks), and — when embodied — ties learning dynamics to real resource state (battery → guṇa).
Interpretability. Unlike opaque agents, it exposes a readable "mind-state" trajectory — life-stage, guṇa mix, plasticity headroom, identity drift — making the learning process auditable.
Embodied & agentic AI. The karma loop (action → consequence → disposition) and metabolic-state-driven behavior give a principled scaffold for autonomous agents that learn and self-regulate in the world, not just on a dataset.
A bridge from cognitive science to ML. Rather than ad-hoc tricks, it contributes a coherent, theory-grounded cognitive architecture — a template for composing modular faculties into one self-regulating whole, shipped as an open, modular library (ChittaKit) that drops into any PyTorch backbone, with a transparent, falsifiable results scorecard others can build on.

In short: it reframes a set of disconnected AI problems — forgetting, energy, hallucination, alignment, adaptivity, interpretability — as facets of one missing capability: a principled architecture for a mind that learns for life and regulates itself. That reframing, with working evidence, is the contribution.

7. Run it

pip install -r requirements.txt          # torch, numpy  (+ torchvision/snntorch for Phase II/C)

cd experiments
python3 integrated_agent.py              # ★ the whole agent + its mind-state trace (CPU, ~1 min)
python3 capacity_benchmark.py            # consolidation / decay / tapas ablation
python3 divya_drsti.py                   # extended perception + Pramāṇa validity gate
python3 track_b_v2.py                    # embodiment: karma loop + battery→guṇa + retention
# GPU (e.g. CUDA_VISIBLE_DEVICES=0):
python3 phase2_vision.py --dataset cifar10   # real CNN on Split-CIFAR
python3 track_c_spiking.py                   # spiking perception net (snnTorch)
python3 make_figures.py                      # regenerate the README figures

chittakit/     the novel modules — saṃskāra · guṇa · meta-guṇa · āśrama · tapas · pramāṇa · witness · antahkarana
experiments/   integrated_agent · capacity/continual benchmarks · divya_drsti · sanjaya · track_b · track_c · phase2_vision
philosophy/    the deep study: texts & mantras → modern science → Sanskrit formulae → math models → architecture
assets/        banner · architecture diagram · results figures
RESULTS.md     every number + the honest scorecard      ROADMAP.md   what's done / what's next

8. How it can be extended (it's amazing because it can grow)

Scale the backbone — drop in a ResNet/ViT or a transformer; the Vedic layer is unchanged.
Neuromorphic hardware — map the spiking parts to Loihi 2 / Akida (via Intel Lava) for the milliwatt, always-on form — the full energy thesis.
Real robot — the embodied agent → Jetson + ROS 2, with a real battery driving the guṇa and an event camera feeding manas.
Meta-learn the controllers — guṇa and tapas via meta-gradient / population-based training.
Richer sims — MuJoCo / Isaac for physics; a continual RL stream for the karma loop at scale.

Each is extension, not invention: the hard part — assembling a coherent, lifelong, self-regulating agent from the Vedic model and proving it on real and spiking hardware — is done.

9. Honest scope

This is a deep-research proof-of-concept at modest scale (small CNNs, MNIST/CIFAR, a gridworld) — enough to prove the architecture works, not to rival a frontier model. Every negative result (the v1 RL-retention miss, the neutral evolution-strategy run, the MNIST over-regularization) was diagnosed and either fixed or kept as a labeled limitation — the discipline that makes the positive results trustworthy. The Vedic↔ML mappings are engineering analogies, clearly flagged; no claim is made that the texts contain neuroscience, and nothing here is conscious.

Code: MIT. Built on the Upaniṣads, Sāṃkhya, Yoga, and modern ML (PyTorch · snnTorch). Part of a deep study of the Vedic philosophy of mind — see philosophy/.

Downloads last month: -; Downloads are not tracked for this model. How to track

Video Preview

Reinforcement Learning