YAML Metadata Warning:empty or missing yaml metadata in repo card

Check out the documentation for more information.

🌊 Torrential Attention β€” vanilla

Results

  • Eval Loss: 5.6815
  • Eval Perplexity: 293.39
  • Parameters: 14,624,024
  • Training Time: 986s

Architecture

Torrential Attention Mechanism with five pillars:

  1. Sigmoid Flood Attention (unnormalized, pressure-driven)
  2. Position-Decay Budgets (cascading energy)
  3. Cross-Layer Flood Thresholds (dam-break phase transitions)
  4. Cross-Head Spillover (tributary merging)
  5. Differential Noise Cancellation
Downloads last month
2
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support