gsa-1.3B-100B / README.md
nielsr's picture
nielsr HF staff
Link model to paper
dfecf91 verified
|
raw
history blame
128 Bytes
Model of the paper [Gated Slot Attention for Efficient Linear-Time Sequence Modeling](https://huggingface.co/papers/2409.07146).