the best collection of RLXF model including RLHF, RLAIF etc.
lil
Amu
AI & ML interests
None yet
Recent Activity
upvoted
a
collection
about 7 hours ago
The Big Benchmarks Collection
updated
a model
about 7 hours ago
Amu/t1-3B
liked
a model
about 7 hours ago
Amu/t1-3B
Organizations
None yet
Collections
3
models
15
Amu/t1-3B
Text Generation
•
Updated
•
8
•
1
Amu/t1-1.5B
Text Generation
•
Updated
•
171
•
1
Amu/supertiny-llama3-0.25B-v0.1
Text Generation
•
Updated
•
89
•
5
Amu/dpo-qlora-Qwen1.5-0.5B-Chat-xtuner
Text Generation
•
Updated
•
28
Amu/orpo-phi2
Text Generation
•
Updated
•
12
Amu/orpo-lora-phi2
Text Generation
•
Updated
•
144
Amu/spin-phi2
Text Generation
•
Updated
•
51
•
9
Amu/r-zephyr-7b-beta-qlora
Updated
Amu/dpo-phi2
Text Generation
•
Updated
•
235
•
2
Amu/zen-moe
Text Generation
•
Updated
•
12
datasets
None public yet