-
CompCap: Improving Multimodal Large Language Models with Composite Captions
Paper • 2412.05243 • Published • 19 -
LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment
Paper • 2412.04814 • Published • 47 -
MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale
Paper • 2412.05237 • Published • 47 -
Exploring Multi-Grained Concept Annotations for Multimodal Large Language Models
Paper • 2412.05939 • Published • 16
Exclibur
Exclibur
AI & ML interests
None yet
Recent Activity
liked
a model
1 day ago
Qwen/QwQ-32B
liked
a model
1 day ago
deepseek-ai/DeepSeek-R1
updated
a collection
about 1 month ago
Interest
Organizations
None yet
Collections
1
models
None public yet