Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
612
10
137
Arthur Zucker
ArthurZ
Follow
langziguo's profile picture
CharlyZ's profile picture
timothepearce's profile picture
203 followers
·
14 following
art_zucker
ArthurZucker
AI & ML interests
None yet
Articles
Fine-Tuning Gemma Models in Hugging Face
Feb 23
•
8
Code Llama: Llama 2 learns to code
Aug 25, 2023
•
2
Organizations
ArthurZ
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
01-ai/Yi-9B
5 days ago
Tokenizer inconsistencies related to HTML tags
4
#11 opened about 1 month ago by
sanderland
New activity in
meta-llama/Meta-Llama-3-8B-Instruct
6 days ago
Update config.json
1
#105 opened 6 days ago by
ArthurZ
New activity in
meta-llama/Meta-Llama-3-70B-Instruct
6 days ago
Update config.json
3
#49 opened 9 days ago by
ArthurZ
New activity in
meta-llama/Meta-Llama-3-70B-Instruct
9 days ago
The sample code for usage with Transformers is incorrect.
2
#45 opened 14 days ago by
endNone
New activity in
meta-llama/Meta-Llama-3-8B-Instruct
9 days ago
How to use EOT_ID
4
#54 opened 26 days ago by
saksham-lamini
New activity in
meta-llama/Meta-Llama-3-8B
9 days ago
Setting `pad_token_id` to `eos_token_id`:128001 for open-end generation.
9
#72 opened 24 days ago by
tianke0711
Unable to load the model for Torch versions starting from 2.0.1
8
#34 opened 30 days ago by
benhachem
New activity in
meta-llama/Meta-Llama-3-70B-Instruct
9 days ago
Update config.json
4
#33 opened 24 days ago by
ArthurZ
Update README.md
1
#31 opened 24 days ago by
shokim
New activity in
meta-llama/Meta-Llama-3-8B-Instruct
9 days ago
Update tokenizer_config.json
15
#60 opened 25 days ago by
Navanit-shorthills
New activity in
meta-llama/Meta-Llama-3-8B-Instruct
24 days ago
Update config.json
1
#71 opened 24 days ago by
ArthurZ
New activity in
meta-llama/Meta-Llama-3-70B
25 days ago
Update generation_config.json
#10 opened 25 days ago by
ArthurZ
New activity in
meta-llama/Meta-Llama-3-70B-Instruct
25 days ago
Update generation_config.json
#30 opened 25 days ago by
ArthurZ
New activity in
meta-llama/Meta-Llama-3-8B
25 days ago
Update generation_config.json
1
#68 opened 25 days ago by
ArthurZ
New activity in
meta-llama/Meta-Llama-3-8B-Instruct
25 days ago
Update generation_config.json
1
#62 opened 25 days ago by
ArthurZ
Update generation_config.json
#61 opened 25 days ago by
ArthurZ
New activity in
meta-llama/Meta-Llama-3-8B
25 days ago
Update generation_config.json
#67 opened 25 days ago by
ArthurZ
Generated text is garbled?
5
#53 opened 26 days ago by
gbhall
is there a chat model? or i need to use specific instruction
2
#63 opened 25 days ago by
Barianc
Llama-3-8B not giving the entire outcome in Google Colab
2
#55 opened 26 days ago by
sayanroy07
how to download llama3
1
#58 opened 26 days ago by
pacopascal
The model repeats the question/answer multiple times in the output
4
#60 opened 26 days ago by
ameljelidi
Issues with tokenizer causing bad performance of model.
2
#66 opened 25 days ago by
Takuonline
Hi, I try to load with LlamaForCausalLM, LlamaTokenizer, but it show me the error that "not a string"
7
#64 opened 25 days ago by
hjewr
New activity in
TRI-ML/mamba-7b-rw
25 days ago
Adding `safetensors` variant of this model
3
#4 opened 26 days ago by
lucataco
New activity in
google/recurrentgemma-2b-it
26 days ago
Fix tokenizer
#11 opened about 1 month ago by
pcuenq
New activity in
google/recurrentgemma-2b
26 days ago
Fix tokenizer
#6 opened about 1 month ago by
pcuenq
New activity in
google/recurrentgemma-2b-it
26 days ago
ValueError: The device_map provided does not give any device for the following parameters: model.normalizer
9
#8 opened about 1 month ago by
LaferriereJC
New activity in
meta-llama/Meta-Llama-3-8B-Instruct
27 days ago
Tokenizer mismatch all the time
2
#47 opened 28 days ago by
tian9
New activity in
meta-llama/Meta-Llama-3-8B
29 days ago
Update tokenizer_config.json to prepend the bos token
7
#35 opened 30 days ago by
eduagarcia
Rotary position embeddings not loaded
1
#39 opened 30 days ago by
cwbc
New activity in
meta-llama/Meta-Llama-3-8B
about 1 month ago
Rename original/tokenizer.model to tokenizer.model
2
#6 opened about 1 month ago by
winglian
New activity in
google/recurrentgemma-2b-it
about 1 month ago
ValueError when use multiple GPUs for inference
2
#10 opened about 1 month ago by
aladinggit
New activity in
google/gemma-1.1-7b-it
about 1 month ago
Fix slow tokenizer
2
#14 opened about 1 month ago by
pcuenq
New activity in
google/recurrentgemma-2b-it
about 1 month ago
I can't load this model on L4 GPU
2
#5 opened about 1 month ago by
albusdd
New activity in
google/gemma-1.1-7b-it-GGUF
about 1 month ago
Add quantized GGUFs?
1
#2 opened about 1 month ago by
MoonRide
New activity in
hf-internal-testing/tiny-random-gpt2
about 1 month ago
Adding `safetensors` variant of this model
#2 opened 3 months ago by
SFconvertbot
New activity in
ai21labs/Jamba-v0.1
about 2 months ago
Fix bias logic to enable QLoRA finetuning
3
#5 opened about 2 months ago by
winglian
New activity in
llava-hf/llava-v1.6-mistral-7b-hf
about 2 months ago
wrong padding token
2
#2 opened 2 months ago by
aliencaocao
New activity in
hpcai-tech/grok-1
about 2 months ago
Upload tokenizer
7
#4 opened about 2 months ago by
ArthurZ
New activity in
CohereForAI/c4ai-command-r-v01
about 2 months ago
Update README.md
1
#34 opened about 2 months ago by
ArthurZ
New activity in
google/gemma-7b-it
2 months ago
Model "gg-hf/gemma-7b-it" doesn't exist.
4
#76 opened 2 months ago by
OfirHaim
New activity in
google/gemma-7b
3 months ago
Very high loss compared to keras
5
#46 opened 3 months ago by
tanimazsin130
New activity in
google/gemma-7b-it
3 months ago
Bug of modeling_gemma.py in transformers 4.38.0
2
#45 opened 3 months ago by
zlk
Fix chat template does not compatible with ConversationalPipeline
5
#42 opened 3 months ago by
hiyouga
Bug about number generation?
4
#30 opened 3 months ago by
myownskyW7
New activity in
google/gemma-7b
3 months ago
RuntimeError: FlashAttention backward for head dim > 192 requires A100/A800 or H100/H800
3
#18 opened 3 months ago by
g-ronimo
New activity in
google/gemma-2b
3 months ago
Tokenizer issue on Colab
16
#7 opened 3 months ago by
arslankas
New activity in
google/gemma-7b-it
3 months ago
Running sample code gives ma a shape error
1
#22 opened 3 months ago by
dzhulgakov
Running sample code has a shape error:
6
#23 opened 3 months ago by
yingliuhf
New activity in
google/gemma-2b
3 months ago
Weird token in the tokenizer?
5
#13 opened 3 months ago by
Lambent
New activity in
google/gemma-7b-it
3 months ago
How can I input the sys message for the gemma instruct model?
3
#25 opened 3 months ago by
Yingding
Type Error when executing dummy script
3
#8 opened 3 months ago by
LKriesch
Update README.md
1
#14 opened 3 months ago by
ariG23498
sample codes do not work
5
#21 opened 3 months ago by
mengyahu
error model.generate()
14
#13 opened 3 months ago by
NickyNicky
New activity in
google/gemma-2b-it
3 months ago
Update README.md
#8 opened 3 months ago by
yekai-g
New activity in
google/gemma-7b
3 months ago
Update README.md
#16 opened 3 months ago by
yekai-g
New activity in
google/gemma-7b-it
3 months ago
Update README.md
#7 opened 3 months ago by
yekai-g
New activity in
google/gemma-2b
3 months ago
update README.md
#5 opened 3 months ago by
yekai-g
Load more