Please can you post the inference code, because this is not working with LLAMA.
This is not a LLaMA model. Read MPT's original model card for instructions.
· Sign up or log in to comment