Should this work with llama2.c from karpathy?

#2
by JulianX4 - opened

See title. Thank you!

It is compatible with llama.cpp. I have not played with llama2.c yet but my best guess is Yes. We use the same arc. & tokenizer with Llama 2. Maybe you need to modify some config files a bit to fit our model shape.

See title. Thank you!

Appreciate if you share your test result here if you plan to test it out.

Thanks.

I tried it before and got “segmentation fault”.

I should have some time later tonight for tinkering, if I get a better result I’ll write an update.

I tried it before and got “segmentation fault”.

I should have some time later tonight for tinkering, if I get a better result I’ll write an update.

I'm hitting the following error when running export.py:

RuntimeError: shape '[32, 2, 32, 2048]' is invalid for input of size 524288

May I know any update from your side? It will be great if you can share your findings so far, I think it will benefit everyone here.

Thank you.

I tried it before and got “segmentation fault”.

I should have some time later tonight for tinkering, if I get a better result I’ll write an update.

I'm hitting the following error when running export.py:

RuntimeError: shape '[32, 2, 32, 2048]' is invalid for input of size 524288

May I know any update from your side? It will be great if you can share your findings so far, I think it will benefit everyone here.

Thank you.

Unfortunately that's as far as I got as well.

Sign up or log in to comment