Owen Oertell's picture

1 3 1

Owen Oertell

ojo2

·

https://owenoertell.com

AI & ML interests

RL

Recent Activity

liked a dataset about 2 months ago

AI-MO/NuminaMath-CoT

authored a paper 9 months ago

REBEL: Reinforcement Learning via Regressing Relative Rewards

upvoted a paper 9 months ago

REBEL: Reinforcement Learning via Regressing Relative Rewards

View all activity

Organizations

None yet

Papers 3

arxiv:2404.16767

arxiv:2404.08495

arxiv:2404.03673

models 2

ojo2/dpo_summarization

Updated Jan 15, 2024

ojo2/gptj_summarize_sft

Text Generation • Updated Dec 12, 2023 • 7

datasets

None public yet