Bingxuan Wang
YellowDoge
AI & ML interests
None yet
Recent Activity
new activity
about 2 months ago
deepseek-ai/DeepSeek-R1-Distill-Qwen-7B:Vocab size in config.json mismatches the actual tokenizer size
authored
a paper
about 2 months ago
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via
Reinforcement Learning
Organizations
None yet
YellowDoge's activity
Vocab size in config.json mismatches the actual tokenizer size
5
#4 opened about 2 months ago
by
Fizzarolli

KeyError: '<|endoftext|>' when using the tokenizer
10
#3 opened 9 months ago
by
YellowDoge