what is the input token length of Falcon-40B and -7B models?

#38

by sermolin - opened Jun 9, 2023

Jun 9, 2023

couldn't find it in the documentation, reference notebook hardcodes it to 1024 mentioning the need to set int8 if the input length is >1024, but what's the max?
use-case: document summarization and text generation. Probably would not want to use --Instruct model for that, right?

Theblaxkertheberry

Jun 22, 2023

Someone for That one ?

mishaml77

Jun 25, 2023

2048

gabriead

Jul 12, 2023

Tried to increase the number of tokens in the openapi.json (cloned the repo and found that simply by searching for 1024) but that didn't help. Created a feature request: https://github.com/huggingface/text-generation-inference/issues/593. Please add to that if you need any adaptations.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment