CogNet-40M

A 39.7M parameter non-transformer language model with O(n) cognitive routing and hierarchical memory.

Architecture

Component Detail
Architecture Non-transformer (Cognitive Routing)
Parameters 39,718,536 (~40M)
Hidden Dim 512
Blocks 6 cognitive blocks
Channels 6 routing channels x 128 dim
FF Dim 1024
Max Seq Len 256
Tokenizer Character-level (136 vocab)

Hierarchical Memory

  • Working Memory (32 slots): Active processing
  • Episodic Memory (64 slots): Short-term recall
  • Semantic Memory (128 slots): Long-term knowledge

Training

Metric Value
Steps 50,000
Batch Size 64
LR 3e-4 (cosine)
Precision FP16 AMP
GPU RTX 5060 Ti 16GB
Final Loss ~0.005
Final PPL ~1.01

Quick Start

from inference import CogNetInference
ai = CogNetInference("cognet_best.pt", "tokenizer_v3.json")
print(ai.generate("Once upon a time"))

AICL Integration

CogNet powers AICL (Architecture Compilation Language) as its native AI engine for code generation, diagnosis, and repair.

Files

File Size Description
cognet_best.pt 152MB FP32 checkpoint
cognet_fp16.pt 77MB FP16 checkpoint
tokenizer_v3.json - Char tokenizer (136 vocab)
config.json - Model config
cognet_model.py - Architecture source
inference.py - Inference script

Roadmap

  • CogNet-40M (39.7M)
  • HuggingFace integration
  • AICL native engine
  • CogNet-1B (1B params)
  • ONNX export

MIT License. Built with PyTorch on RTX 5060 Ti via QuickPod.

Downloads last month
64
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support