Collections

Discover the best community collections!

Collections including paper arxiv:2309.00267
Dataset generation
Collection by about 19 hours ago
LLM Refs
Collection by Apr 30
Preference Alignment in LLM
methods that align llm with human preference
LLM Datasets
Collection by Mar 5
Deep Reinforcement Learning
Features implementations and paces of popular RL algorithms and new paradigms on a variety of environments.
Super Alignment
Collection by 18 days ago
RL/Alignment
Collection by about 10 hours ago
Human Feedback
Collection by Feb 8