arxiv:2410.12491
Jared Joselowitz
jaredjoss
·
AI & ML interests
None yet
Recent Activity
updated
a model
11 days ago
jaredjoss/reward-models
updated
a model
11 days ago
jaredjoss/reward-models
authored
a paper
2 months ago
Insights from the Inverse: Reconstructing LLM Training Goals Through
Inverse RL
Organizations
None yet
Papers
1
spaces
2
models
23
jaredjoss/reward-models
Updated
jaredjoss/pythia-70m-irl-29eps-01-rlhf-model
Updated
•
2
jaredjoss/pythia-70m-irl-29eps-0035-rlhf-model
Updated
•
4
jaredjoss/pythia-410m-irl-6eps-15reps-rlhf-model
Updated
•
2
jaredjoss/pythia-70m-dahoas-hh-1-epoch-1000-steps-1e-7-lr-sft
Updated
•
1
jaredjoss/pythia-160m-dahoas-hh-1-epoch-1000-steps-1e-7-lr-sft
Updated
•
1
jaredjoss/pythia-410m-dahoas-hh-1-epoch-10000-steps-sft
Updated
•
2
jaredjoss/pythia-160m-dahoas-hh-1-epoch-10000-steps-sft
Updated
•
2
jaredjoss/pythia-70m-dahoas-hh-1-epoch-10000-steps-sft
Updated
•
2
jaredjoss/pythia-70m-irl-10eps-58reps-rlhf-model
Updated
•
2
datasets
6
jaredjoss/jaredjoss-jigsaw-long-2000_160M_toxic
Viewer
•
Updated
•
1k
•
36
jaredjoss/jaredjoss-jigsaw-long-2000_160M_non_toxic
Viewer
•
Updated
•
1k
•
35
jaredjoss/jaredjoss-jigsaw-long-2000_410M_toxic
Viewer
•
Updated
•
1k
•
37
jaredjoss/jaredjoss-jigsaw-long-2000_410M_non_toxic
Viewer
•
Updated
•
1k
•
35
jaredjoss/jigsaw-long-2000
Viewer
•
Updated
•
2k
•
28
jaredjoss/allenai-combined
Viewer
•
Updated
•
999
•
5