Text Generation
Transformers
PyTorch
English
llama
sft
Inference Endpoints
text-generation-inference

Is this compatible with GGML?

#4
by Laurab - opened

I'm trying to convert this model to GGML but I'm getting

$ python convert.py models/llama-2-orca
...
Exception: Vocab size mismatch (model has 32016, but models/tokenizer.model combined with models/added_tokens.json has 32006).

Sign up or log in to comment