view article Article Mastering Tensor Dimensions in Transformers By not-lain • about 1 month ago • 43
view article Article MiniMax-01 is Now Open-Source: Scaling Lightning Attention for the AI Agent Era By MiniMax-AI • 28 days ago • 40
view article Article Low Latency CPU Based Educational Value Classifier With Generic Educational Value By kenhktsui • Jun 12, 2024 • 9
view article Article From DeepSpeed to FSDP and Back Again with Hugging Face Accelerate Jun 13, 2024 • 45