Collections

Discover the best community collections!

Collections including paper arxiv:2309.00267
RL/Alignment
Collection by 14 days ago
LLM Refs
Collection by 17 days ago
Preference Alignment in LLM
methods that align llm with human preference
LLM Datasets
Collection by Mar 5
Deep Reinforcement Learning
Features implementations and paces of popular RL algorithms and new paradigms on a variety of environments.
Super Alignment
Collection by 3 days ago
Dataset generation
Collection by 19 days ago
Human Feedback
Collection by Feb 8