Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models Paper • 2304.09842 • Published Apr 19, 2023 • 2
Retrieval Meets Reasoning: Even High-school Textbook Knowledge Benefits Multimodal Reasoning Paper • 2405.20834 • Published May 31, 2024 • 1
Matryoshka Query Transformer for Large Vision-Language Models Paper • 2405.19315 • Published May 29, 2024 • 1
Thinking Like an Expert:Multimodal Hypergraph-of-Thought (HoT) Reasoning to boost Foundation Modals Paper • 2308.06207 • Published Aug 11, 2023 • 1
KAM-CoT: Knowledge Augmented Multimodal Chain-of-Thoughts Reasoning Paper • 2401.12863 • Published Jan 23, 2024 • 1
OmniVLM: A Token-Compressed, Sub-Billion-Parameter Vision-Language Model for Efficient On-Device Inference Paper • 2412.11475 • Published Dec 16, 2024 • 1