Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
3
1
Owen Oertell
ojo2
Follow
lunarflu's profile picture
DaiYijia's profile picture
2 followers
·
1 following
https://owenoertell.com
AI & ML interests
RL
Recent Activity
liked
a dataset
about 2 months ago
AI-MO/NuminaMath-CoT
authored
a paper
9 months ago
REBEL: Reinforcement Learning via Regressing Relative Rewards
upvoted
a
paper
9 months ago
REBEL: Reinforcement Learning via Regressing Relative Rewards
View all activity
Organizations
None yet
Papers
3
arxiv:
2404.16767
arxiv:
2404.08495
arxiv:
2404.03673
models
2
Sort: Recently updated
ojo2/dpo_summarization
Updated
Jan 15, 2024
ojo2/gptj_summarize_sft
Text Generation
•
Updated
Dec 12, 2023
•
7
datasets
None public yet