Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision Paper • 2403.09472 • Published Mar 14, 2024 • 1
Forward-Backward Reasoning in Large Language Models for Mathematical Verification Paper • 2308.07758 • Published Aug 15, 2023 • 4
DeepVecFont-v2: Exploiting Transformers to Synthesize Vector Fonts with Higher Quality Paper • 2303.14585 • Published Mar 25, 2023
Evaluating Language Models as Synthetic Data Generators Paper • 2412.03679 • Published Dec 4, 2024 • 46
An Empirical Analysis of Compute-Optimal Inference for Problem-Solving with Language Models Paper • 2408.00724 • Published Aug 1, 2024 • 1
Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision Paper • 2403.09472 • Published Mar 14, 2024 • 1
Self-Play Preference Optimization for Language Model Alignment Paper • 2405.00675 • Published May 1, 2024 • 27
Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models Paper • 2405.01535 • Published May 2, 2024 • 121