Pipeline explanation

#2
by Mikee283 - opened

Hi, first of all thank you for your work on llamafied model, It saved a me a lot of time and effort and it works perfectly out of the box.

I wish to replicate your results so if you could spare some time to explain the process that would be awsome.

Btw, do you also plan to do the same for 128k model?

Once again thank you for the time you put into this.

Owner

Here's the convert code: https://huggingface.co/vonjack/phi-3-mini-4k-instruct-llamafied/blob/main/convert.py
I don't think we can convert 128k model because it uses longrope which not existed in the original llama.

Sign up or log in to comment