is this the llama-3-8b model clone?

by malhajar - opened Apr 18

Discussion

malhajar

Apr 18

lucasjin

Apr 19

No, this is false llama3

shimmyshimmer

Unsloth AI org Apr 19

:D

Yes, it is the clone but it is specifically designed for Unsloth users to train it 2xfaster with 60% less memory etc

shimmyshimmer

Unsloth AI org Apr 19

No, this is false llama3

Sorry what is this supposed to mean? :)

danielhanchen

Unsloth AI org Apr 19

Yes it's a clone :) But no gated access, and works seamlessly for Unsloth users

lucasjin

Apr 19

Am just throw a false alarm in case author didn't response.

jiluan

Apr 19

Performance seems worse than llama2-7b

lucasjin

Apr 21

This is base model

shimmyshimmer

Unsloth AI org Apr 21

This is base model

correct it is the base model!

shimmyshimmer

Unsloth AI org Apr 21

Performance seems worse than llama2-7b

Really? Do you happen to see in which areas?

ewre324

Apr 21

The performance of 8B Instruct models (unsloth) are much better than their llama2 counterpath, however logical reasoning is still not so good, for example asking which is heavier 1 kg feather or 2 kg feather yields wrong output.

upupbug

11 days ago

•

edited 11 days ago

The hash code of the files between https://hf-mirror.com/unsloth/llama-3-8b/ and https://huggingface.co/meta-llama/Meta-Llama-3-8B/ is different,
so if you sure that it's a clone?
@shimmyshimmer

danielhanchen

Unsloth AI org 10 days ago

•

edited 10 days ago

@upupbug Oh sorry actually there is a difference - our base model trained the <eot> and <start_header> tokens since it was untrained in llama-3 base. We only editted the lm_head and embed_tokens for these 2 tokens

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment