Automatic data mixture method for large language model pre-training
![Sea AI Lab's profile picture](https://cdn-avatars.huggingface.co/v1/production/uploads/1643440185801-5df833bdda6d0311fd3d5403.png)
Sea AI Lab
company
Verified
AI & ML interests
None defined yet.
models
37
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1643440185801-5df833bdda6d0311fd3d5403.png)
sail/scaling-vocab-3b-32k-overtrain
Text Generation
•
Updated
•
2
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1643440185801-5df833bdda6d0311fd3d5403.png)
sail/scaling-vocab-3b-43k-overtrain
Text Generation
•
Updated
•
3
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1643440185801-5df833bdda6d0311fd3d5403.png)
sail/Zephyr-7B-DICE-Iter2
Text Generation
•
Updated
•
3
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1643440185801-5df833bdda6d0311fd3d5403.png)
sail/Zephyr-7B-DICE-Iter1
Text Generation
•
Updated
•
1
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1643440185801-5df833bdda6d0311fd3d5403.png)
sail/Llama-3-Base-8B-DICE-Iter2
Text Generation
•
Updated
•
2
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1643440185801-5df833bdda6d0311fd3d5403.png)
sail/Llama-3-Base-8B-DICE-Iter1
Text Generation
•
Updated
•
3
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1643440185801-5df833bdda6d0311fd3d5403.png)
sail/Sailor-0.5B-Chat-gguf
Updated
•
215
•
3
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1643440185801-5df833bdda6d0311fd3d5403.png)
sail/Sailor-1.8B-Chat-gguf
Updated
•
217
•
2
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1643440185801-5df833bdda6d0311fd3d5403.png)
sail/Sailor-4B-Chat-gguf
Updated
•
210
•
2
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1643440185801-5df833bdda6d0311fd3d5403.png)
sail/Sailor-7B-Chat-gguf
Updated
•
206
•
5