Text Generation
Transformers
Safetensors
llama
code
granite
Eval Results
text-generation-inference

Question

#2
by mrfakename - opened

Hi
Thanks for releasing Granite, can’t wait to try it out. If it’s based on the Llama arch, why does it need Transformers 4.41?
Thanks!
(PS: thanks for using the Llama arch instead of a custom one - makes it so much easier to tune :))

IBM Granite org

hi @mrfakename , the llama arch required adding a new parameter 'mlp_bias'
PR: https://github.com/huggingface/transformers/pull/30031
rest is similar to llama

thx for the explanation! makes sense

mrfakename changed discussion status to closed

Sign up or log in to comment