Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary Feedback Paper • 2501.10799 • Published Jan 18 • 15
Taiwan LLM: Bridging the Linguistic Divide with a Culturally Aligned Language Model Paper • 2311.17487 • Published Nov 29, 2023 • 2