Check the model's maximum input value

#9
by minhdang - opened

I want to check the maximum input value of model K8_0 but I don't see instructions related to it?

Following is a sample loading statement, model = AutoModelForCausalLM.from_pretrained('TheBloke/Llama-2-7B-Chat-GGML', model_file = 'llama-2-7b-chat.ggmlv3.q4_K_S.bin', context_length=1024). I think that the maximum input context window is 1024.

Sign up or log in to comment