Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
144.5
TFLOPS
652
11
162
Arthur Zucker
ArthurZ
Follow
mrm8488's profile picture
CCP6's profile picture
qnguyen3's profile picture
234 followers
·
16 following
art_zucker
ArthurZucker
AI & ML interests
None yet
Articles
Fine-Tuning Gemma Models in Hugging Face
Feb 23
•
17
Code Llama: Llama 2 learns to code
Aug 25, 2023
•
3
Organizations
ArthurZ
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
meta-llama/Meta-Llama-3.1-405B-FP8
3 days ago
Update tokenizer to prepend special token
#12 opened 3 days ago by
lysandre
New activity in
meta-llama/Meta-Llama-3.1-70B
3 days ago
Update tokenizer to prepend special token
1
#11 opened 3 days ago by
lysandre
New activity in
meta-llama/Meta-Llama-3.1-8B-Instruct
3 days ago
Upload tokenizer
2
#29 opened 3 days ago by
ArthurZ
Upload tokenizer
#28 opened 3 days ago by
ArthurZ
New activity in
meta-llama/Meta-Llama-3.1-405B-Instruct-FP8
3 days ago
Upload tokenizer
1
#9 opened 3 days ago by
ArthurZ
Update `_name_or_path` to the HF model id
#8 opened 3 days ago by
davidthomas426
New activity in
meta-llama/Meta-Llama-3.1-8B
3 days ago
Update tokenizer to prepend special token
1
#12 opened 3 days ago by
lysandre
New activity in
meta-llama/Meta-Llama-3.1-405B-Instruct
3 days ago
Upload tokenizer
1
#9 opened 3 days ago by
ArthurZ
New activity in
meta-llama/Meta-Llama-3.1-70B-Instruct
3 days ago
Upload tokenizer
1
#12 opened 3 days ago by
ArthurZ
New activity in
ArthurZ/new-t5-base
3 days ago
Upload tokenizer
#1 opened 3 days ago by
ArthurZ
New activity in
meta-llama/Meta-Llama-3.1-8B-Instruct
3 days ago
Upload tokenizer
#27 opened 3 days ago by
ArthurZ
New activity in
meta-llama/Meta-Llama-3.1-70B-Instruct
3 days ago
Upload tokenizer
#11 opened 3 days ago by
ArthurZ
New activity in
meta-llama/Meta-Llama-3.1-405B-FP8
3 days ago
Fix quantization_config to work with vLLM v0.5.3.post1
1
#11 opened 3 days ago by
davidthomas426
New activity in
meta-llama/Meta-Llama-3.1-8B-Instruct
3 days ago
DO NOT MERGE v2 make sure vllm and transformers work
#12 opened 3 days ago by
ArthurZ
DO NOT MERGE test for vllm
2
#11 opened 3 days ago by
ArthurZ
New activity in
meta-llama/Meta-Llama-3.1-70B
3 days ago
Can we add `use_scaled_rope` in the config.json?
4
#2 opened 10 days ago by
lanking
New activity in
meta-llama/Llama-Guard-3-8B-INT8
3 days ago
Update config.json
#6 opened 3 days ago by
ArthurZ
New activity in
meta-llama/Llama-Guard-3-8B
3 days ago
Update config.json
#9 opened 3 days ago by
ArthurZ
New activity in
meta-llama/Meta-Llama-3.1-70B
3 days ago
Update config.json
#9 opened 3 days ago by
ArthurZ
New activity in
meta-llama/Meta-Llama-3.1-70B-Instruct
3 days ago
Update config.json
#6 opened 3 days ago by
ArthurZ
New activity in
meta-llama/Meta-Llama-3.1-70B
3 days ago
Update config.json
#8 opened 3 days ago by
ArthurZ
New activity in
meta-llama/Meta-Llama-3.1-8B
3 days ago
Update config.json
#10 opened 3 days ago by
ArthurZ
New activity in
google/gemma-2-27b-it
28 days ago
Model repeating information and "spitting out" random characters
2
#12 opened 28 days ago by
brazilianslib
Hallucinations, misspellings etc. Something seems broken?
18
#10 opened 29 days ago by
sam-paech
New activity in
google/gemma-2-27b-it
29 days ago
transformers load fails?
7
#6 opened 29 days ago by
bdambrosio
New activity in
google/gemma-2-9b
about 1 month ago
Runtime autograd error due to inplace operations
1
#4 opened about 1 month ago by
xianbin
New activity in
microsoft/Florence-2-large
about 1 month ago
Please add to llama.cpp and ollama
3
#21 opened about 1 month ago by
KeilahElla
New activity in
meta-llama/Meta-Llama-3-8B
about 2 months ago
Why are "add_bos_token" and "add_eos_token" missing in tokenizer_config.json ?
1
#140 opened 2 months ago by
ekurtic
New activity in
mistralai/Mistral-7B-Instruct-v0.3
about 2 months ago
Slow tokenizer problem.
4
#22 opened 2 months ago by
bradhutchings
New activity in
meta-llama/Meta-Llama-3-8B
2 months ago
LlamaTokenizerFast.from_pretrained gives incorrect number of tokens for Llama3
2
#156 opened 2 months ago by
farzadab
New activity in
mistralai/Mistral-7B-Instruct-v0.3
2 months ago
Add minor reference to transformers
4
#7 opened 2 months ago by
osanseviero
Upload tokenizer
#6 opened 2 months ago by
ArthurZ
Upload tokenizer
#5 opened 2 months ago by
ArthurZ
New activity in
mistralai/Mistral-7B-v0.3
2 months ago
Update README.md
#4 opened 2 months ago by
ArthurZ
Update README.md
#3 opened 2 months ago by
ArthurZ
New activity in
mistralai/Mistral-7B-Instruct-v0.3
2 months ago
Update README.md
#4 opened 2 months ago by
ArthurZ
Update config.json
1
#3 opened 2 months ago by
ArthurZ
New activity in
mistralai/Mistral-7B-v0.3
2 months ago
Upload MistralForCausalLM
#2 opened 2 months ago by
ArthurZ
New activity in
mistralai/Mistral-7B-Instruct-v0.3
2 months ago
Upload MistralForCausalLM
#2 opened 2 months ago by
ArthurZ
New activity in
mistralai/Mistral-7B-v0.3
2 months ago
Upload tokenizer
1
#1 opened 2 months ago by
ArthurZ
New activity in
mistralai/Mistral-7B-Instruct-v0.3
2 months ago
Upload tokenizer
#1 opened 2 months ago by
ArthurZ
New activity in
01-ai/Yi-9B
2 months ago
Tokenizer inconsistencies related to HTML tags
4
#11 opened 4 months ago by
sanderland
New activity in
meta-llama/Meta-Llama-3-8B-Instruct
2 months ago
Update config.json
1
#105 opened 2 months ago by
ArthurZ
New activity in
meta-llama/Meta-Llama-3-70B-Instruct
2 months ago
Update config.json
3
#49 opened 3 months ago by
ArthurZ
New activity in
meta-llama/Meta-Llama-3-70B-Instruct
3 months ago
The sample code for usage with Transformers is incorrect.
2
#45 opened 3 months ago by
endNone
New activity in
meta-llama/Meta-Llama-3-8B-Instruct
3 months ago
How to use EOT_ID
4
#54 opened 3 months ago by
saksham-lamini
New activity in
meta-llama/Meta-Llama-3-8B
3 months ago
Setting `pad_token_id` to `eos_token_id`:128001 for open-end generation.
9
#72 opened 3 months ago by
tianke0711
Unable to load the model for Torch versions starting from 2.0.1
9
#34 opened 3 months ago by
benhachem
New activity in
meta-llama/Meta-Llama-3-70B-Instruct
3 months ago
Update config.json
4
#33 opened 3 months ago by
ArthurZ
Update README.md
1
#31 opened 3 months ago by
kimseungho
New activity in
meta-llama/Meta-Llama-3-8B-Instruct
3 months ago
Update tokenizer_config.json
16
#60 opened 3 months ago by
Navanit-shorthills
Update config.json
1
#71 opened 3 months ago by
ArthurZ
New activity in
meta-llama/Meta-Llama-3-70B
3 months ago
Update generation_config.json
#10 opened 3 months ago by
ArthurZ
New activity in
meta-llama/Meta-Llama-3-70B-Instruct
3 months ago
Update generation_config.json
#30 opened 3 months ago by
ArthurZ
New activity in
meta-llama/Meta-Llama-3-8B
3 months ago
Update generation_config.json
1
#68 opened 3 months ago by
ArthurZ
New activity in
meta-llama/Meta-Llama-3-8B-Instruct
3 months ago
Update generation_config.json
1
#62 opened 3 months ago by
ArthurZ
Update generation_config.json
#61 opened 3 months ago by
ArthurZ
New activity in
meta-llama/Meta-Llama-3-8B
3 months ago
Update generation_config.json
#67 opened 3 months ago by
ArthurZ
Generated text is garbled?
5
#53 opened 3 months ago by
gbhall
is there a chat model? or i need to use specific instruction
2
#63 opened 3 months ago by
Barianc
Load more