Jingcheng Hu
reign12
AI & ML interests
Foundation models and alignment
Recent Activity
new activity
about 16 hours ago
Open-Reasoner-Zero/orz_math_72k_collection_extended:Add link to paper, task category
new activity
about 16 hours ago
Open-Reasoner-Zero/Open-Reasoner-Zero-Critic-32B:Add library and pipeline tags
new activity
about 16 hours ago
Open-Reasoner-Zero/Open-Reasoner-Zero-Critic-7B:Add pipeline tag and library name
Organizations
reign12's activity
Add link to paper, task category
#3 opened 1 day ago
by
nielsr

Add library and pipeline tags
#1 opened 1 day ago
by
nielsr

Add pipeline tag and library name
#1 opened 1 day ago
by
nielsr

Add pipeline tag and library name
#1 opened 1 day ago
by
nielsr

Add pipeline tag and library_name
#1 opened 1 day ago
by
nielsr

Add pipeline tag and library name
#1 opened 1 day ago
by
nielsr

Add pipeline and library tags
#1 opened 1 day ago
by
nielsr

Improve model card with metadata and links
#1 opened 1 day ago
by
nielsr

Add task category, correct arxiv link
#1 opened 1 day ago
by
nielsr

Add library and pipeline tags
#1 opened 1 day ago
by
nielsr

Add task category
#2 opened 1 day ago
by
nielsr

Is there any source of the original Chinese QA pair?
#2 opened 5 days ago
by
reign12

Make viewer work
#1 opened 7 days ago
by
davanstrien

Add paper link
#3 opened 10 months ago
by
AdinaY

33B when?
2
#8 opened over 1 year ago
by
nova434431
Question about evaluating this reward model on Anthropic/hh-rlhf
1
#4 opened almost 2 years ago
by
songff
More details on training data for reward model
#2 opened over 1 year ago
by
reign12

How is this dataset filtered?
#1 opened over 1 year ago
by
reign12

大神是怎么收集这么多高质量的数据的啊
3
#1 opened about 2 years ago
by
leonall