view article Article From DeepSpeed to FSDP and Back Again with Hugging Face Accelerate 7 days ago • 19
DeepSpeed-VisualChat: Multi-Round Multi-Image Interleave Chat via Multi-Modal Causal Attention Paper • 2309.14327 • Published Sep 25, 2023 • 21