This collections contains the list of model being trained and evaluated in the preprint: SimPO: Simple Preference Optimization with a Reference-Free R
Princeton NLP group
princeton-nlp
AI & ML interests
None yet
Organizations
Collections
1
models
229
princeton-nlp/Mistral-7B-Instruct-RDPO
Text Generation
•
Updated
•
1
princeton-nlp/Mistral-7B-Instruct-ORPO
Text Generation
•
Updated
princeton-nlp/Mistral-7B-Instruct-KTO
Text Generation
•
Updated
princeton-nlp/Mistral-7B-Instruct-IPO
Text Generation
•
Updated
princeton-nlp/Mistral-7B-Instruct-DPO
Text Generation
•
Updated
princeton-nlp/Mistral-7B-Base-SFT-SimPO
Text Generation
•
Updated
•
8
princeton-nlp/Mistral-7B-Base-SFT-RDPO
Text Generation
•
Updated
•
1
princeton-nlp/Mistral-7B-Base-SFT-KTO
Text Generation
•
Updated
princeton-nlp/Mistral-7B-Base-SFT-IPO
Text Generation
•
Updated
princeton-nlp/Mistral-7B-Base-SFT-DPO
Text Generation
•
Updated
datasets
32
princeton-nlp/llama3-ultrafeedback
Viewer
•
Updated
•
360
•
3
princeton-nlp/QuRatedPajama-1B_tokens_for_analysis
Viewer
•
Updated
•
1
•
3
princeton-nlp/QuRatedPajama-260B
Viewer
•
Updated
•
3
•
5
princeton-nlp/SWE-bench
Viewer
•
Updated
•
34.5k
•
54
princeton-nlp/SWE-bench_bm25_50k_llama
Viewer
•
Updated
•
1
•
4
princeton-nlp/SWE-bench_Lite
Viewer
•
Updated
•
23.4k
•
7
princeton-nlp/SWE-bench_bm25_13k_cl100k
Viewer
•
Updated
•
1
princeton-nlp/SWE-bench_bm25_27k_cl100k
Viewer
•
Updated
•
2
princeton-nlp/SWE-bench_oracle_llama
Viewer
•
Updated
•
1
•
1
princeton-nlp/SWE-bench_oracle_cl100k
Viewer
•
Updated