arxiv:2406.06608
michael ilie PRO
skdrx
AI & ML interests
None yet
Recent Activity
updated
a dataset
about 2 months ago
skdrx/python-dpo-dataset-complete-just-formatting
updated
a dataset
about 2 months ago
skdrx/python-dpo-dataset-varname
Organizations
Papers
1
models
6
skdrx/ds_coder_6.7_inst_rlsf_varname
Updated
•
11
skdrx/amd135m_reasoning_finetune
Updated
•
24
skdrx/rslf_dscoder1.3b-inst-varname-gguf
Updated
•
2
skdrx/rlsf_ds_1.3b_instruct_varname
Text Generation
•
Updated
•
6
skdrx/rlstarfmodel_ds_inst
Updated
skdrx/Replete-LLM-Qwen2-7b_Beta-Preview-Q4_K_S-GGUF
Updated
•
3
datasets
11
skdrx/python-dpo-dataset-complete-just-formatting
Viewer
•
Updated
•
37k
•
41
skdrx/python-dpo-dataset-varname
Viewer
•
Updated
•
2k
•
44
skdrx/python-dpo-dataset-formatted
Viewer
•
Updated
•
2k
•
43
skdrx/python-dpo-dataset-varname-formatted-combined-ONLYSYSTEMPROMPT
Viewer
•
Updated
•
1k
•
42
skdrx/python-dpo-dataset-varname-formatted-combined-NOSYSTEMPROMPT
Viewer
•
Updated
•
1k
•
41
skdrx/python-dpo-dataset-varname-formatted-ONLYSYSTEMPROMPT
Viewer
•
Updated
•
1k
•
44
skdrx/python-dpo-dataset-varname-formatted-NOSYSTEMPROMPT
Viewer
•
Updated
•
1k
•
43
skdrx/python-dpo-dataset-varname-formatted-combined
Viewer
•
Updated
•
2k
•
42
skdrx/python-dpo-dataset-varname-formatted
Viewer
•
Updated
•
2k
•
43
skdrx/rlsf_dpo
Viewer
•
Updated
•
10k
•
3
•
1