Collections

Discover the best community collections!

Collections including paper arxiv:2403.07691
About ORPO
Contains some information and experiments fine-tuning LLMs using 🤗 `trl.ORPOTrainer`
RLHF
Collection by 7 days ago
Papers - Fine-tuning
Collection by Apr 25
ORPO
This is the official collection of "ORPO: Monolithic Preference Optimization without Reference Model".
Training
Collection by 3 days ago
RLHF
Collection by Mar 19
NLP paper
Collection by Apr 25
AI Papers
Collection by 12 days ago