SparDA: Sparse Decoupled Attention for Efficient Long-Context LLM Inference Paper • 2606.04511 • Published 13 days ago • 3
LocateAnything: Fast and High-Quality Vision-Language Grounding with Parallel Box Decoding Paper • 2605.27365 • Published 21 days ago • 140