RegMix: Data Mixture as Regression for Language Model Pre-training Paper • 2407.01492 • Published Jul 1 • 33
🥼 DrMistral Collection Mistral and Llama models trained on a corpus of French and English data to act as a medical chatbot and ace exams. • 9 items • Updated Aug 16 • 8