Arthur Zucker

#11 opened about 1 month ago by

sanderland

New activity in meta-llama/Meta-Llama-3-8B-Instruct 6 days ago

Update config.json

#105 opened 6 days ago by

New activity in meta-llama/Meta-Llama-3-70B-Instruct 6 days ago

Update config.json

#49 opened 9 days ago by

New activity in meta-llama/Meta-Llama-3-70B-Instruct 9 days ago

The sample code for usage with Transformers is incorrect.

#45 opened 14 days ago by

endNone

New activity in meta-llama/Meta-Llama-3-8B-Instruct 9 days ago

How to use EOT_ID

#54 opened 26 days ago by

saksham-lamini

New activity in meta-llama/Meta-Llama-3-8B 9 days ago

Setting `pad_token_id` to `eos_token_id`:128001 for open-end generation.

9

#72 opened 24 days ago by

tianke0711

Unable to load the model for Torch versions starting from 2.0.1

8

#34 opened 30 days ago by

benhachem

New activity in meta-llama/Meta-Llama-3-70B-Instruct 9 days ago

Update config.json

#33 opened 24 days ago by

Update README.md

#31 opened 24 days ago by

shokim

New activity in meta-llama/Meta-Llama-3-8B-Instruct 9 days ago

Update tokenizer_config.json

15

#60 opened 25 days ago by

Navanit-shorthills

New activity in meta-llama/Meta-Llama-3-8B-Instruct 24 days ago

Update config.json

#71 opened 24 days ago by

New activity in meta-llama/Meta-Llama-3-70B 25 days ago

Update generation_config.json

#10 opened 25 days ago by

New activity in meta-llama/Meta-Llama-3-70B-Instruct 25 days ago

Update generation_config.json

#30 opened 25 days ago by

New activity in meta-llama/Meta-Llama-3-8B 25 days ago

Update generation_config.json

#68 opened 25 days ago by

New activity in meta-llama/Meta-Llama-3-8B-Instruct 25 days ago

Update generation_config.json

#62 opened 25 days ago by

Update generation_config.json

#61 opened 25 days ago by

New activity in meta-llama/Meta-Llama-3-8B 25 days ago

Update generation_config.json

#67 opened 25 days ago by

Generated text is garbled?

#53 opened 26 days ago by

gbhall

is there a chat model? or i need to use specific instruction

#63 opened 25 days ago by

Barianc

Llama-3-8B not giving the entire outcome in Google Colab

#55 opened 26 days ago by

sayanroy07

how to download llama3

#58 opened 26 days ago by

pacopascal

The model repeats the question/answer multiple times in the output

#60 opened 26 days ago by

ameljelidi

Issues with tokenizer causing bad performance of model.

#66 opened 25 days ago by

Takuonline

Hi, I try to load with LlamaForCausalLM, LlamaTokenizer, but it show me the error that "not a string"

7

#64 opened 25 days ago by

hjewr

New activity in TRI-ML/mamba-7b-rw 25 days ago

Adding `safetensors` variant of this model

#4 opened 26 days ago by

lucataco

New activity in google/recurrentgemma-2b-it 26 days ago

Fix tokenizer

#11 opened about 1 month ago by

pcuenq

New activity in google/recurrentgemma-2b 26 days ago

Fix tokenizer

#6 opened about 1 month ago by

pcuenq

New activity in google/recurrentgemma-2b-it 26 days ago

ValueError: The device_map provided does not give any device for the following parameters: model.normalizer

9

#8 opened about 1 month ago by

LaferriereJC

New activity in meta-llama/Meta-Llama-3-8B-Instruct 27 days ago

Tokenizer mismatch all the time

#47 opened 28 days ago by

tian9

New activity in meta-llama/Meta-Llama-3-8B 29 days ago

Update tokenizer_config.json to prepend the bos token

7

#35 opened 30 days ago by

eduagarcia

Rotary position embeddings not loaded

#39 opened 30 days ago by

cwbc

New activity in meta-llama/Meta-Llama-3-8B about 1 month ago

Rename original/tokenizer.model to tokenizer.model

#6 opened about 1 month ago by

winglian

New activity in google/recurrentgemma-2b-it about 1 month ago

ValueError when use multiple GPUs for inference

#10 opened about 1 month ago by

aladinggit

New activity in google/gemma-1.1-7b-it about 1 month ago

Fix slow tokenizer

#14 opened about 1 month ago by

pcuenq

New activity in google/recurrentgemma-2b-it about 1 month ago

I can't load this model on L4 GPU

#5 opened about 1 month ago by

albusdd

New activity in google/gemma-1.1-7b-it-GGUF about 1 month ago

Add quantized GGUFs?

#2 opened about 1 month ago by

MoonRide

New activity in hf-internal-testing/tiny-random-gpt2 about 1 month ago

Adding `safetensors` variant of this model

#2 opened 3 months ago by

SFconvertbot

New activity in ai21labs/Jamba-v0.1 about 2 months ago

Fix bias logic to enable QLoRA finetuning

#5 opened about 2 months ago by

winglian

New activity in llava-hf/llava-v1.6-mistral-7b-hf about 2 months ago

wrong padding token

#2 opened 2 months ago by

aliencaocao

New activity in hpcai-tech/grok-1 about 2 months ago

Upload tokenizer

7

#4 opened about 2 months ago by

New activity in CohereForAI/c4ai-command-r-v01 about 2 months ago

Update README.md

#34 opened about 2 months ago by

New activity in google/gemma-7b-it 2 months ago

Model "gg-hf/gemma-7b-it" doesn't exist.

#76 opened 2 months ago by

OfirHaim

New activity in google/gemma-7b 3 months ago

Very high loss compared to keras

#46 opened 3 months ago by

tanimazsin130

New activity in google/gemma-7b-it 3 months ago

Bug of modeling_gemma.py in transformers 4.38.0

#45 opened 3 months ago by

zlk

Fix chat template does not compatible with ConversationalPipeline

#42 opened 3 months ago by

hiyouga

Bug about number generation?

#30 opened 3 months ago by

myownskyW7

New activity in google/gemma-7b 3 months ago

RuntimeError: FlashAttention backward for head dim > 192 requires A100/A800 or H100/H800

#18 opened 3 months ago by

g-ronimo

New activity in google/gemma-2b 3 months ago

Tokenizer issue on Colab

16

#7 opened 3 months ago by

arslankas

New activity in google/gemma-7b-it 3 months ago

Running sample code gives ma a shape error

#22 opened 3 months ago by

dzhulgakov

Running sample code has a shape error:

6

#23 opened 3 months ago by

yingliuhf

New activity in google/gemma-2b 3 months ago

Weird token in the tokenizer?

#13 opened 3 months ago by

Lambent

New activity in google/gemma-7b-it 3 months ago

How can I input the sys message for the gemma instruct model?

#25 opened 3 months ago by

Yingding

Type Error when executing dummy script

#8 opened 3 months ago by

LKriesch

Update README.md

#14 opened 3 months ago by

ariG23498

sample codes do not work

#21 opened 3 months ago by

mengyahu

error model.generate()

14

#13 opened 3 months ago by

NickyNicky

New activity in google/gemma-2b-it 3 months ago

Update README.md

#8 opened 3 months ago by

New activity in google/gemma-7b 3 months ago

Update README.md

#16 opened 3 months ago by

New activity in google/gemma-7b-it 3 months ago

Update README.md

#7 opened 3 months ago by

New activity in google/gemma-2b 3 months ago

update README.md

#5 opened 3 months ago by