MaziyarPanahi/Llama-Nemotron-Post-Training-Dataset-v1-ShareGPT Viewer • Updated 10 days ago • 30.2M • 1.25k • 31
MiniMax-01: Scaling Foundation Models with Lightning Attention Paper • 2501.08313 • Published Jan 14 • 282