Arthur Zucker's picture

Arthur Zucker

ArthurZ

·

AI & ML interests

None yet

Recent Activity

liked a model about 3 hours ago

Qwen/Qwen1.5-MoE-A2.7B

liked a model 6 days ago

google/paligemma2-10b-mix-448

liked a model 8 days ago

google/paligemma2-3b-mix-448

View all activity

Organizations

ArthurZ's activity

New activity in mistral-community/pixtral-12b about 2 months ago

Fastest way for inference?

#28 opened about 2 months ago by

New activity in deepseek-ai/DeepSeek-R1 about 2 months ago

model-00078-of-000163.safetensors not marked safe?

#80 opened about 2 months ago by

New activity in kyutai/helium-1-preview-2b 2 months ago

Update tokenizer_config.json

#1 opened 2 months ago by

New activity in mistralai/Pixtral-Large-Instruct-2411 4 months ago

Upload transformers version

#3 opened 4 months ago by

New activity in huggingface/documentation-images 4 months ago

Upload Meta-Llama-3-8B-Instruct, seqlen = 512, python, w_ compile.png

#392 opened 4 months ago by

New activity in mistral-community/pixtral-12b 5 months ago

Update model weight

#13 opened 5 months ago by

Update hidden_act to silu

#14 opened 5 months ago by

New activity in rhymes-ai/Aria 6 months ago

llama.cpp support

#1 opened 6 months ago by

New activity in google/gemma-2-2b-jpn-it 6 months ago

tokenizer_config.json is different from gemma-2-2b-it

#8 opened 6 months ago by

New activity in mistral-community/pixtral-12b 6 months ago

How can i use the full 24GB model instead of this separated safetensors files?

#8 opened 6 months ago by

New activity in meta-llama/Llama-3.2-11B-Vision-Instruct 6 months ago

hidden_activation vs hidden_act in config.json

#10 opened 6 months ago by

New activity in mistral-community/pixtral-12b-240910 6 months ago

How to use safetensors?

#13 opened 6 months ago by

New activity in mistral-community/pixtral-12b 6 months ago

lamma cpp ht to gguf not working

#2 opened 6 months ago by

New activity in meta-llama/Llama-3.1-405B-Instruct-FP8 7 months ago

8-kv-heads

#14 opened 8 months ago by

New activity in meta-llama/Llama-3.1-405B-FP8 7 months ago

Update config.json

#17 opened 7 months ago by

Config KV Heads should be 8 now?

#16 opened 8 months ago by

New activity in meta-llama/Llama-3.1-405B-Instruct-FP8 8 months ago

8 kv heads

#13 opened 8 months ago by

New activity in meta-llama/Llama-3.1-405B-FP8 8 months ago

8-kv-heads

#15 opened 8 months ago by

New activity in meta-llama/Llama-3.1-405B 8 months ago

8-kv-heads

#21 opened 8 months ago by

New activity in meta-llama/Llama-3.1-405B-Instruct 8 months ago

8-kv-heads

#17 opened 8 months ago by