Mamba-YOLO-World: Marrying YOLO-World with Mamba for Open-Vocabulary Detection Paper • 2409.08513 • Published Sep 13 • 10
In Search of Needles in a 10M Haystack: Recurrent Memory Finds What LLMs Miss Paper • 2402.10790 • Published Feb 16 • 40
Hydragen: High-Throughput LLM Inference with Shared Prefixes Paper • 2402.05099 • Published Feb 7 • 18
Can Mamba Learn How to Learn? A Comparative Study on In-Context Learning Tasks Paper • 2402.04248 • Published Feb 6 • 30
Continual Pre-Training of Large Language Models: How to (re)warm your model? Paper • 2308.04014 • Published Aug 8, 2023 • 2
Elephant Neural Networks: Born to Be a Continual Learner Paper • 2310.01365 • Published Oct 2, 2023 • 1