What's up with these versions?

by supercharge19 - opened May 8, 2024

Discussion

supercharge19

May 8, 2024

Are they continuation of previous model finetuned on same data, but for longer (more epochs)?

MaziyarPanahi

Owner May 8, 2024

Yes, the Llama-3-8B-Instruct series will be based on the previous DPO fine-tuned models in order to improve them. The Leaderboard failed to calculate GSM8K for some of them, so I am not sure where exactly they stand in terms of scores.

supercharge19

May 8, 2024

Yes, the Llama-3-8B-Instruct series will be based on the previous DPO fine-tuned models in order to improve them. The Leaderboard failed to calculate GSM8K for some of them, so I am not sure where exactly they stand in terms of scores.

I don't trust leader board anymore. And thanks for quick response

MaziyarPanahi

Owner May 8, 2024

Me too. By now I learned to do a series of vibe tests, long-text input test, long-text output generation, and couple of other questions that were problematic before. Some 7B and some 72B models with very high score, just don't work properly. So I developed my own vibe tests before going any further with any model.

supercharge19

May 8, 2024

Great work!

notzero

May 9, 2024

Hi @MaziyarPanahi Thanks for your great works, is this also 32k context ? And the GGUF version already fixed for tokenizer bug ?

MaziyarPanahi

Owner May 10, 2024

Hi @MaziyarPanahi Thanks for your great works, is this also 32k context ? And the GGUF version already fixed for tokenizer bug ?

Hi, you are very welcome. The model is the native 8K, however, you can easily change the RoPE theta and extend it to 16k or 32k with minimum loss in accuracy.

The GGUF models use the latest Llama.cpp, however, if you noticed anything please let me know. The model is small so I can fix it quickly.

notzero

Aug 11, 2024

@MaziyarPanahi thanks

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment