Block Transformer: Global-to-Local Language Modeling for Fast Inference Paper • 2406.02657 • Published 3 days ago • 25
Meteor: Mamba-based Traversal of Rationale for Large Language and Vision Models Paper • 2405.15574 • Published 14 days ago • 48
Distort, Distract, Decode: Instruction-Tuned Model Can Refine its Response from Noisy Instructions Paper • 2311.00233 • Published Nov 1, 2023 • 4
Navigating Data Heterogeneity in Federated Learning: A Semi-Supervised Approach for Object Detection Paper • 2310.17097 • Published Oct 26, 2023 • 3