Chang S
kkokkie2360
·
AI & ML interests
None yet
Recent Activity
updated
a model
6 days ago
deepseek-ai/DeepSeek-R1
new activity
6 days ago
deepseek-ai/DeepSeek-R1:Update model_max_length in tokenizer_config.json
new activity
6 months ago
meta-llama/Llama-3.1-405B-Instruct-FP8:8-kv-heads
Organizations
kkokkie2360's activity
Update model_max_length in tokenizer_config.json
#139 opened 6 days ago
by
kkokkie2360
8-kv-heads
8
#14 opened 6 months ago
by
ArthurZ
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1674683851722-62441cb7456803e95009a08f.jpeg)
8 kv heads
2
#13 opened 7 months ago
by
kkokkie2360
8 kv heads
2
#13 opened 7 months ago
by
kkokkie2360