💡 DICE - a sail Collection

sail 's Collections

🔱 Sailor2 Language Models

🧬 RegMix: Data Mixture as Regression

📈 Scaling Laws with Vocabulary

⚓️ Sailor Language Models

💡 DICE

updated Jul 28

Self-alignment with DPO Implicit Rewards