Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
7
3
chen.yiwan
yiwan
Follow
SteveSHEN's profile picture
1 follower
·
0 following
chenyiwan
AI & ML interests
None yet
Recent Activity
new
activity
about 2 months ago
Qwen/QwQ-32B:
复杂推理进入死循环
new
activity
about 2 months ago
Qwen/QwQ-32B:
遇到复杂问题时,开始推理时有<think>,推理结束了还没有</think>
new
activity
2 months ago
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B:
[RESOLVED] Model is not outputting the <think> token at the beginning.
View all activity
Organizations
None yet
yiwan
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
Qwen/QwQ-32B
about 2 months ago
复杂推理进入死循环
30
#21 opened about 2 months ago by
frankgxy
遇到复杂问题时,开始推理时有<think>,推理结束了还没有</think>
6
#29 opened about 2 months ago by
digits12
New activity in
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
2 months ago
[RESOLVED] Model is not outputting the <think> token at the beginning.
5
#37 opened 2 months ago by
bsvaz
liked
2 models
2 months ago
deepseek-ai/DeepSeek-R1
Text Generation
•
Updated
27 days ago
•
1.78M
•
•
12k
deepseek-ai/DeepSeek-R1-Distill-Qwen-14B
Text Generation
•
Updated
Feb 24
•
924k
•
•
501
New activity in
baichuan-inc/Baichuan-7B
almost 2 years ago
这是什么情况?
1
#9 opened almost 2 years ago by
s134564
注意,这不是chat版本
2
#10 opened almost 2 years ago by
chaochaoli
24G的显卡跑两下就歇菜了
5
#5 opened almost 2 years ago by
itkingtao
liked
a model
almost 2 years ago
baichuan-inc/Baichuan-7B
Text Generation
•
Updated
Jan 9, 2024
•
15.2k
•
838
Load more