Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
lv12
's Collections
Representation Learning
Preference Optimization
Information Retrieval
Preference Optimization
updated
Jun 14
x
Upvote
1
A Roadmap to Pluralistic Alignment
Paper
•
2402.05070
•
Published
Feb 7
Self-Rewarding Language Models
Paper
•
2401.10020
•
Published
Jan 18
•
144
SakanaAI/DiscoPOP-zephyr-7b-gemma
Text Generation
•
Updated
Jun 13
•
5.99k
•
36
Upvote
1
Share collection
View history
Collection guide
Browse collections