Congliu/Chinese-DeepSeek-R1-Distill-data-110k Viewer โข Updated 21 days ago โข 110k โข 8.03k โข 528
DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search Paper โข 2408.08152 โข Published Aug 15, 2024 โข 56
Scaling Synthetic Data Creation with 1,000,000,000 Personas Paper โข 2406.20094 โข Published Jun 28, 2024 โข 97