Yongming Rao

raoyongming

AI & ML interests

None yet

Recent Activity

authored a paper 6 days ago

Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models

upvoted a collection 6 days ago

Insight-V

upvoted a paper 6 days ago

Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models

View all activity

Organizations

None yet

raoyongming's activity

authored a paper 6 days ago

Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models

Paper • 2411.14432 • Published 6 days ago • 19

upvoted a collection 6 days ago

Insight-V

Collection

Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models • 5 items • Updated 6 days ago • 7

upvoted a paper 6 days ago

Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models

Paper • 2411.14432 • Published 6 days ago • 19

liked a Space 2 months ago

Running on Zero

100

💬

Oryx

upvoted a paper 2 months ago

MaskBit: Embedding-free Image Generation via Bit Tokens

Paper • 2409.16211 • Published Sep 24 • 16

authored a paper 2 months ago

Oryx MLLM: On-Demand Spatial-Temporal Understanding at Arbitrary Resolution

Paper • 2409.12961 • Published Sep 19 • 24

upvoted a paper 2 months ago

Oryx MLLM: On-Demand Spatial-Temporal Understanding at Arbitrary Resolution

Paper • 2409.12961 • Published Sep 19 • 24

authored a paper 4 months ago

Coarse Correspondence Elicit 3D Spacetime Understanding in Multimodal Language Model

Paper • 2408.00754 • Published Aug 1 • 21

upvoted 2 papers 4 months ago

Coarse Correspondence Elicit 3D Spacetime Understanding in Multimodal Language Model

Paper • 2408.00754 • Published Aug 1 • 21

Efficient Inference of Vision Instruction-Following Models with Elastic Cache

Paper • 2407.18121 • Published Jul 25 • 16

authored a paper 4 months ago

Efficient Inference of Vision Instruction-Following Models with Elastic Cache

Paper • 2407.18121 • Published Jul 25 • 16

authored a paper 11 months ago

Generative Multimodal Models are In-Context Learners

Paper • 2312.13286 • Published Dec 20, 2023 • 34

authored a paper 12 months ago

Sherpa3D: Boosting High-Fidelity Text-to-3D Generation via Coarse 3D Prior

Paper • 2312.06655 • Published Dec 11, 2023 • 23

liked a Space almost 2 years ago

Runtime error

👁

Yongming Rao

AI & ML interests

Recent Activity

Organizations

raoyongming's activity

Oryx

Unipc Sdm