the best collection of RLXF model including RLHF, RLAIF etc.
lil
Amu
AI & ML interests
None yet
Organizations
None yet
Collections
2
models
14
Amu/dpo-full-Qwen1.5-0.5B-Chat-xtuner
Updated
Amu/dpo-qlora-Qwen1.5-0.5B-Chat-alignment-handbook
Text Generation
•
Updated
•
1.83k
Amu/dpo-qlora-Qwen1.5-0.5B-Chat-xtuner
Updated
Amu/orpo-phi2
Text Generation
•
Updated
•
1.89k
Amu/orpo-lora-phi2
Text Generation
•
Updated
•
1.87k
Amu/spin-phi2
Text Generation
•
Updated
•
3.76k
•
9
Amu/r-zephyr-7b-beta-qlora
Updated
Amu/dpo-phi2
Text Generation
•
Updated
•
2.67k
•
2
Amu/zen-moe
Text Generation
•
Updated
•
1.88k
Amu/zen
Text Generation
•
Updated
•
2.58k
datasets
None public yet