Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
DmitryRyuminΒ 
posted an update 12 days ago
Post
1885
πŸ”₯πŸš€πŸŒŸ New Research Alert - xLSTM! πŸŒŸπŸš€πŸ”₯
πŸ“„ Title: xLSTM: Extended Long Short-Term Memory πŸ”

πŸ“ Description: xLSTM is a scaled-up LSTM architecture with exponential gating and modified memory structures to mitigate known limitations. xLSTM blocks outperform SOTA transformers and state-space models in performance and scaling.

Eagerly awaiting the code release! πŸ•’οΈ

πŸ‘₯ Authors: Maximilian Beck et al.

πŸ“„ Paper: xLSTM: Extended Long Short-Term Memory (2405.04517)

πŸ“š More Papers: more cutting-edge research presented at other conferences in the DmitryRyumin/NewEraAI-Papers curated by @DmitryRyumin

πŸ” Keywords: #xLSTM #DeepLearning #Innovation #AI