YAML Metadata Warning:empty or missing yaml metadata in repo card

Check out the documentation for more information.

NeuroMamba: Brain-Inspired Language Model

A novel architecture combining selective state space models, spiking neurons, neural memory banks, and predictive coding โ€” designed to generalize like the human brain while fitting in 8GB RAM.

Architecture

Component Brain Analog Technical Approach
Selective SSM Core Thalamic gating Mamba-style selective state space (linear time)
Spiking Activation Neuron firing Surrogate-gradient spiking neurons
Neural Memory Bank Hippocampus Differentiable memory with content-based read/write
Bio-MoE Cortical columns Sparse mixture-of-experts with bio-inspired routing
Predictive Coding Top-down predictions Auxiliary loss for next-state prediction

Quick Start

# Test the model
python3 test_model.py

# Train on TinyStories (small dataset for testing)
bash run_training.sh

# Train with custom config
python3 train_hf.py \
    --dataset roneneldan/TinyStories \
    --d_model 512 \
    --n_layers 8 \
    --batch_size 4 \
    --num_train_epochs 3

Model Sizes

Config Parameters Memory (fp16) Notes
Tiny (d=256, l=6) ~25M ~50MB Fast iteration
Small (d=512, l=8) ~130M ~260MB 8GB RAM fits easily
Medium (d=768, l=12) ~350M ~700MB Strong baseline

Key Features

  • Linear complexity: No quadratic attention โ€” scales to long sequences
  • Sparse computation: Spiking neurons + MoE = most parameters inactive
  • Structured memory: Neural memory bank enables systematic generalization
  • Predictive coding: Auxiliary loss improves representation learning

Citations

Based on:

  • Mamba (Gu & Dao, 2023) โ€” selective state spaces
  • Griffin (De et al., 2024) โ€” gated linear recurrences
  • SpikeGPT (Chu et al., 2023) โ€” spiking language models
  • Neural Turing Machines (Graves et al., 2014) โ€” differentiable memory
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support