Context Size

by Lumpen1 - opened Jun 18, 2023

Discussion

Lumpen1

Jun 18, 2023

•

edited Jun 18, 2023

Does this model have a 8k context like the original? Thank you very much for all the models btw.

TheBloke

Owner Jun 18, 2023

Initial reports are suggesting no, unfortunately not. I've had one negative report from a KoboldCpp user for example.

That might be a limitation of the KoboldCpp software itself, or it might be a restriction of the starcoder GGML implementation. I'm not sure yet.

alihkhawaher

Jun 19, 2023

•

edited Jun 19, 2023

Hello, yes, it works with 8K ctx in KoboldCpp. You just need to write the value manually in the interface, you can actually ignore the slider and edit the number directly. I just tested this.

alihkhawaher

Jun 19, 2023

Initial reports are suggesting no, unfortunately not. I've had one negative report from a KoboldCpp user for example.

That might be a limitation of the KoboldCpp software itself, or it might be a restriction of the starcoder GGML implementation. I'm not sure yet.

Actually, yes, it works with 8K ctx in KoboldCpp. You just need to write the value manually in the interface, you can actually ignore the slider and edit the number directly. I just tested this.

TheBloke

Owner Jun 19, 2023

@alihkhawaher Excellent, thank you for the feedback! I will update the README to mention this

@concedo do you think a future KoboldCpp update could allow this to be set directly in the slider, without needing to hack it? That would make KoboldCpp the perfect choice for these larger context models

concedo

Jun 20, 2023

@TheBloke Actually you don't have to hack it, the number above the slider is already an editable textbox. You can type in any number you want, and it will work if the model supports it.

A caveat is for LLAMA based models where extra memory for a bigger context needs to be pre-allocated when loading the model, this can be set with the launcher parameter --contextsize The context size can then subsequently be reduced in the UI, but not increased.

The reason why the slider is capped at 2048 is more a matter of practicality - most models are only trained on 2048 context and coherence rapidly breaks down above it, resulting in the model generating nonsense, and users may not know why. That's why 2048 was used as the default upper limit.

TheBloke

Owner Jun 20, 2023

•

edited Jun 20, 2023

OK thanks for the info, @concedo . I misunderstood what had to be done. Yes that looks absolutely fine. I've updated the README like so:

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment