Text Generation
Transformers
PyTorch
llama
text-generation-inference
Inference Endpoints
WizardLM-70B-V1.0 / tokenizer_config.json

Commit History

Maximum sequence length for a Llama 2 model is 4096 (#3)
153924a

WizardLM TheBloke commited on

70B V1.0
37558d7

WizardLM commited on