wx13
wx13
ยท
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 1 month ago
Self-rewarding correction for mathematical reasoning
liked
a dataset
11 months ago
RLHFlow/prompt-collection-v0.1
upvoted
a
collection
11 months ago
Online RLHF
Organizations
None yet
models
None public yet
datasets
None public yet