Resources

Answer questions beyond the content provided

#20 opened 3 days ago by

longhtgg

megatron format to HF format

#19 opened 3 days ago by

jjw0126

[AUTOMATED] Model Memory Requirements

#18 opened 14 days ago by

model-sizer-bot

generation_config.json adds a mapping with the special token '<|im_end|>' to solve the problem of non-stop generation when <|im_end|> is encountered.

#17 opened 19 days ago by

zjyhf

The tokenizer adds a special token '<|im_end|>' to solve the problem of non-stop generation when encountering <|im_end|>.

#16 opened 19 days ago by

zjyhf

How to use in llama.cpp server

#15 opened 20 days ago by

subbur

how to set context in multi-turn QA?

#14 opened 25 days ago by

J22

Update README.md

#13 opened 25 days ago by

freyacoltman

Try to run with dedicated endpoint 4x A100 320GB still get not enough hardware capacity

#11 opened 28 days ago by

trungnx26

Colab Notebook

#10 opened 29 days ago by

ChristophSchuhmann

Megatron LM training (fine-tuning) code ?

#9 opened 29 days ago by

StephennFernandes

If i make context empty, it will output chinese.

#8 opened 30 days ago by

Cometyang

Adding `safetensors` variant of this model

#7 opened about 1 month ago by

SFconvertbot

Adding `safetensors` variant of this model

#6 opened about 1 month ago by

SFconvertbot

Chat template

#5 opened about 1 month ago by

bartowski

Adding `safetensors` variant of this model

#4 opened about 1 month ago by

SFconvertbot

I got answer with the token "ologne" at the end

#3 opened about 1 month ago by

Stilgar