Mu Cai's picture

6 6 3

Mu Cai

mucai

·

https://pages.cs.wisc.edu/~mucai/

AI & ML interests

Computer Vision, Deep Learning, 3D Vision, Vision and Language,

Organizations

mucai's activity

upvoted a paper 3 months ago

Vinoground: Scrutinizing LMMs over Dense Temporal Reasoning with Short Videos

Paper • 2410.02763 • Published Oct 3 • 7

upvoted 2 papers 5 months ago

Matryoshka Multimodal Models

Paper • 2405.17430 • Published May 27 • 31

FunAudioLLM: Voice Understanding and Generation Foundation Models for Natural Interaction Between Humans and LLMs

Paper • 2407.04051 • Published Jul 4 • 35

upvoted a paper 6 months ago

LLaRA: Supercharging Robot Learning Data for Vision-Language Policy

Paper • 2406.20095 • Published Jun 28 • 17

upvoted a collection 7 months ago

Matryoshka Multimodal Models

3 items • Updated Aug 4 • 3

upvoted a paper 8 months ago

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published Apr 22 • 253