Answer questions beyond the content provided
#20 opened 3 days ago
by
longhtgg
megatron format to HF format
#19 opened 3 days ago
by
jjw0126
[AUTOMATED] Model Memory Requirements
#18 opened 14 days ago
by
model-sizer-bot
generation_config.json adds a mapping with the special token '<|im_end|>' to solve the problem of non-stop generation when <|im_end|> is encountered.
3
#17 opened 19 days ago
by
zjyhf
The tokenizer adds a special token '<|im_end|>' to solve the problem of non-stop generation when encountering <|im_end|>.
1
#16 opened 19 days ago
by
zjyhf
How to use in llama.cpp server
2
#15 opened 20 days ago
by
subbur
how to set context in multi-turn QA?
6
#14 opened 25 days ago
by
J22
Update README.md
#13 opened 25 days ago
by
freyacoltman
Try to run with dedicated endpoint 4x A100 320GB still get not enough hardware capacity
5
#11 opened 28 days ago
by
trungnx26
Colab Notebook
1
#10 opened 29 days ago
by
ChristophSchuhmann
Megatron LM training (fine-tuning) code ?
3
#9 opened 29 days ago
by
StephennFernandes
If i make context empty, it will output chinese.
6
#8 opened 30 days ago
by
Cometyang
Adding `safetensors` variant of this model
#7 opened about 1 month ago
by
SFconvertbot
Adding `safetensors` variant of this model
#6 opened about 1 month ago
by
SFconvertbot
Chat template
15
#5 opened about 1 month ago
by
bartowski
Adding `safetensors` variant of this model
#4 opened about 1 month ago
by
SFconvertbot
I got answer with the token "ologne" at the end
1
#3 opened about 1 month ago
by
Stilgar