arxiv:2412.04317
Bo Tong
Tongbo
AI & ML interests
multimodal learning and efficient training
Recent Activity
authored
a paper
2 days ago
FlashSloth: Lightning Multimodal Large Language Models via Embedded
Visual Compression
commented
a paper
3 days ago
EMOVA: Empowering Language Models to See, Hear and Speak with Vivid
Emotions
New activity
about 1 month ago
whyu/MM-Vet_Evaluator:Inquiry About Model API for Answer Post-Processing
Organizations
None yet