Doge-CheckPoint Collection A series of checkPoint weights that can continue training on new datasets without spikes of the training. • 3 items • Updated 1 day ago • 1
YuLan-Mini Collection A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details. • 5 items • Updated Dec 29, 2024 • 12
Wonderful Matrices: Combining for a More Efficient and Effective Foundation Model Architecture Paper • 2412.11834 • Published Dec 16, 2024 • 7
Cheems: Wonderful Matrices More Efficient and More Effective Architecture Paper • 2407.16958 • Published Jul 24, 2024 • 3