51 Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model · 6 authors 2
12 DeepSpeed-FastGen: High-throughput Text Generation for LLMs via MII and DeepSpeed-Inference · 11 authors 2
10 SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers · 6 authors 1