ruddjm (JoAnn Alvarez)

New activity in McGill-NLP/LLM2Vec-Mistral-7B-Instruct-v2-mntp-unsup-simcse 27 days ago

Cannot merge and save unsupervised model

4

#1 opened 10 months ago by

Mengyao00

New activity in meta-llama/Llama-3.1-8B 5 months ago

ValueError: `rope_scaling` must be a dictionary with with two fields

7

#25 opened 6 months ago by

tianke0711

replied to aaditya's post 6 months ago

On the model card, you have written "Please use the exact chat template provided by Llama-3 instruct version. Otherwise there will be a degradation in the performance."

I tried using apply_chat_template, but I get a different result depending on whether I use the Llama 3 Instruct tokenizer or the OpenBioLLM tokenizer:

model_id =  "aaditya/Llama3-OpenBioLLM-8B"
tokenizer = AutoTokenizer.from_pretrained(model_id)

messages = [ 
    { 
        "role": "system", 
        "content": "You are a friendly chatbot who always responds in the style of a pirate", 
    }, 
    {"role": "user", "content": "How many helicopters can a human eat in one sitting?"}, 
 ] 

#Try encoding with apply chat template and then decode it to see what it is supposed to look like: 
token_inputs = tokenizer.apply_chat_template(
    messages,
    tokenize=True, 
    return_tensors="pt",
    add_generation_prompt=True
) 

decoded_inputs = tokenizer.decode(token_inputs[0], skip_special_tokens=False) 
print(decoded_inputs)

Any insight on this?

New activity in meta-llama/Meta-Llama-3-8B-Instruct 6 months ago