Max Context length?

by lazyDataScientist - opened Feb 26, 2024

Discussion

lazyDataScientist

Feb 26, 2024

Just wondering what the max context length for this model has at the moment

gamendez98

Feb 27, 2024

It doesn’t have a hard-coded max context length like a transformer. It works kind of like a LSTM. You can just keep adding input and it will keep going. It “remembers” the mast context selectively so it doesn’t loose too much performance.

see: https://arxiv.org/pdf/2312.00752.pdf

Here they talk about it in the part about synthetic tasks

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment