YeonJu Kim's picture

7 2

YeonJu Kim

yeonju7kim

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 months ago

Are Vision-Language Models Truly Understanding Multi-vision Sensor?

upvoted a paper 4 months ago

VLsI: Verbalized Layers-to-Interactions from Large to Small Vision Language Models

upvoted a paper 6 months ago

Phantom of Latent for Large Language and Vision Models

View all activity

Organizations

yeonju7kim's activity

upvoted a paper 3 months ago

Are Vision-Language Models Truly Understanding Multi-vision Sensor?

Paper • 2412.20750 • Published Dec 30, 2024 • 20

upvoted a paper 4 months ago

VLsI: Verbalized Layers-to-Interactions from Large to Small Vision Language Models

Paper • 2412.01822 • Published Dec 2, 2024 • 15

upvoted a paper 6 months ago

Phantom of Latent for Large Language and Vision Models

Paper • 2409.14713 • Published Sep 23, 2024 • 29

upvoted a paper 7 months ago

SPARK: Multi-Vision Sensor Perception and Reasoning Benchmark for Large-scale Vision-Language Models

Paper • 2408.12114 • Published Aug 22, 2024 • 14

liked a dataset 8 months ago

topyun/SPARK

Viewer • Updated Aug 23, 2024 • 6.25k • 142 • 15

authored 2 papers 10 months ago

CODE: Contrasting Self-generated Description to Combat Hallucination in Large Multi-modal Models

Paper • 2406.01920 • Published Jun 4, 2024 • 1

What if...?: Counterfactual Inception to Mitigate Hallucination Effects in Large Multimodal Models

Paper • 2403.13513 • Published Mar 20, 2024 • 1

liked a Space 10 months ago

Meteor

Generate text and answer questions using images and text

upvoted a paper 10 months ago

Meteor: Mamba-based Traversal of Rationale for Large Language and Vision Models

Paper • 2405.15574 • Published May 24, 2024 • 55

upvoted 2 papers almost 2 years ago

DragDiffusion: Harnessing Diffusion Models for Interactive Point-based Image Editing

Paper • 2306.14435 • Published Jun 26, 2023 • 20

Scalable 3D Captioning with Pretrained Models

Paper • 2306.07279 • Published Jun 12, 2023 • 15