8 1

wujinzhu

kimjohn

jinzhuer

AI & ML interests

Large Language Model, Natural Language Processing, Computer Vision

Recent Activity

commented on a paper 4 days ago

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

new activity about 2 months ago

qihoo360/TinyR1-32B-Preview:What kind of model merge method do you use ?

new activity about 2 months ago

qihoo360/TinyR1-32B-Preview:Repeated Thinking Tags in Output Generation

View all activity

Organizations

None yet

kimjohn's activity

commented a paper 4 days ago

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published 9 days ago • 106 •

New activity in qihoo360/TinyR1-32B-Preview about 2 months ago

What kind of model merge method do you use ?

#17 opened about 2 months ago by

jiuerbujie

Repeated Thinking Tags in Output Generation

#2 opened 2 months ago by

xldistance

Output repeating

#1 opened 2 months ago by

getfit

使用chatbox输出重复，并且思考标签只有第二个

#14 opened about 2 months ago by

mrguo

Dataset

#13 opened about 2 months ago by

PSM24

TypeError argument 'tokens': 'NoneType' object cannot be converted to 'PyString'

#4 opened about 2 months ago by

youyc22

upvoted a collection 2 months ago

360Zhinao2

Collection

360Zhinao2 language model, include both base and chat model • 7 items • Updated Mar 5 • 1