A quick update on the development of Kurtis-EON1 (Echo-DSRN)

by mrs83 - opened Feb 21

ethicalabs.ai org Feb 21

I am currently finalizing the pre-print paper detailing the O(1) memory footprint, the "infinite" context extrapolation, and our training curriculum on AMD hardware (MI300x/Strix Halo).

because I want to ensure absolute transparency, I am releasing a "Work in Progress" draft of the paper for early peer review within my network before the final arXiv submission.

I also want to make a crucial architectural clarification regarding the model arch. Echo-DSRN is not a derivative of Google Titans architecture.

This is a continuation of RNN experiments I began back in 2016 for pure text generation and the foundational architecture has been in the works long before the current linear-RNN renaissance. The base model pre-training framework is based on a 10+ years old codebase, re-written in pure PyTorch.

Where Google's Titans comes into play is strictly as an inspiration for the surprise-based Gating mechanism.

Their research provided a highly elegant framing for using auto-predictive error (Surprise) to gate memory updates, which we integrated into our existing dual-state recurrent blocks to prevent the deep state from being vanished.

Echo-DSRN is a native PyTorch, O(1) recurrent engine built from the ground up. The goal is to build something that can run entirely on consumer edge devices without relying on massive GPU clusters or the Transformer's KV-cache.

The base weights remain withheld while we complete the SFT/DPO alignment phases, but I will be sharing the draft paper with select researchers shortly.

Thank you to everyone!!

mrs83

ethicalabs.ai org Feb 21

I removed references to Google Titans to prevent IP misattribution.

Echo-DSRN is an independent dual-state recurrent architecture tracing back to 2016 foundational experiments, not a Google derivative or wrapper.

Titans (surprise-gating), xLSTM (parallel scan), and Hymba (RMSNorm) will be formally cited as mechanistic inspirations and related works in the upcoming arXiv pre-print.

This ensures accurate community tracking of the base model's lineage.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment