Self-Detoxifying Language Models via Toxification Reversal Paper • 2310.09573 • Published Oct 14, 2023
E2CL: Exploration-based Error Correction Learning for Embodied Agents Paper • 2409.03256 • Published Sep 5
Subtle Errors Matter: Preference Learning via Error-injected Self-editing Paper • 2410.06638 • Published Oct 9
Direct Preference Optimization Using Sparse Feature-Level Constraints Paper • 2411.07618 • Published Nov 12 • 15
Direct Preference Optimization Using Sparse Feature-Level Constraints Paper • 2411.07618 • Published Nov 12 • 15
sorry-bench/ft-mistral-7b-instruct-v0.2-sorry-bench-202406 Text Generation • Updated Jul 2 • 1.37k • 4
Awesome SFT datasets Collection A curated list of interesting datasets to fine-tune language models with. • 43 items • Updated Apr 12 • 123