Valentin Perminov
Valentin71
·
AI & ML interests
None yet
Recent Activity
new activity
about 1 month ago
Qwen/Qwen2.5-1.5B-Instruct:Special tokens
new activity
about 1 month ago
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B:bos_token_id mismatch between model config and tokenizer
Organizations
None yet
Valentin71's activity
Special tokens
#7 opened about 1 month ago
by
Valentin71
bos_token_id mismatch between model config and tokenizer
2
#25 opened about 2 months ago
by
guangy10

Error:Sliding Window Attention is enabled but not implemented for `sdpa`; unexpected results may be encountered
3
#27 opened about 1 month ago
by
fffutr30