Eliminating Biased Length Reliance of Direct Preference Optimization via Down-Sampled KL Divergence Paper • 2406.10957 • Published Jun 16 • 1
Calibrating LLMs with Preference Optimization on Thought Trees for Generating Rationale in Science Question Scoring Paper • 2406.19949 • Published Jun 28 • 1
AERA Collection Resources for EMNLP 2023 Paper: Distilling ChatGPT for Explainable Automated Student Answer Assessment • 3 items • Updated Oct 14 • 1
MCTS with Preference Optimisation Collection Resources for EMNLP 2024 Paper: Calibrating LLMs with Preference Optimization on Thought Trees for Generating Rationale in Science Question Scoring • 8 items • Updated Oct 14 • 1
SamPO Collection Resources for EMNLP 2024 Paper: Eliminating Biased Length Reliance of Direct Preference Optimization via Down-Sampled KL Divergence • 4 items • Updated Oct 14 • 2