DICE - a sail Collection

sail 's Collections

🧬 RegMix: Data Mixture as Regression

📈 Scaling Laws with Vocabulary

DICE

⚓️ Sailor Language Models

DICE

updated 4 days ago

Self-alignment with DPO Implicit Rewards