HanSaem Kim

kensaem

AI & ML interests

None yet

Recent Activity

upvoted a paper about 19 hours ago

InstantCharacter: Personalize Any Characters with a Scalable Diffusion Transformer Framework

upvoted a paper about 21 hours ago

Cobra: Efficient Line Art COlorization with BRoAder References

upvoted a paper about 21 hours ago

BitNet b1.58 2B4T Technical Report

View all activity

Organizations

None yet

kensaem's activity

upvoted a paper about 19 hours ago

InstantCharacter: Personalize Any Characters with a Scalable Diffusion Transformer Framework

Paper • 2504.12395 • Published 7 days ago • 16

upvoted 7 papers about 21 hours ago

FlexIP: Dynamic Control of Preservation and Personality for Customized Image Generation

Paper • 2504.07405 • Published 14 days ago • 12

PixelFlow: Pixel-Space Generative Models with Flow

Paper • 2504.07963 • Published 13 days ago • 19

Compass Control: Multi Object Orientation Control for Text-to-Image Generation

Paper • 2504.06752 • Published 15 days ago • 10

VisualCloze: A Universal Image Generation Framework via Visual In-Context Learning

Paper • 2504.07960 • Published 13 days ago • 46

upvoted 3 papers 9 days ago

VLM-R1: A Stable and Generalizable R1-style Large Vision-Language Model

Paper • 2504.07615 • Published 14 days ago • 30

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published 9 days ago • 239

Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model

Paper • 2504.08685 • Published 12 days ago • 121

upvoted 6 papers 10 days ago

TAPNext: Tracking Any Point (TAP) as Next Token Prediction

Paper • 2504.05579 • Published 16 days ago • 5

Kimi-VL Technical Report

Paper • 2504.07491 • Published 14 days ago • 120

Caption Anything in Video: Fine-grained Object-centric Captioning via Spatiotemporal Multimodal Prompting

Paper • 2504.05541 • Published 16 days ago • 16

OmniCaptioner: One Captioner to Rule Them All

Paper • 2504.07089 • Published 14 days ago • 20

HiFlow: Training-free High-Resolution Image Generation with Flow-Aligned Guidance

Paper • 2504.06232 • Published 15 days ago • 12

Less-to-More Generalization: Unlocking More Controllability by In-Context Generation

Paper • 2504.02160 • Published 21 days ago • 35

upvoted a collection 10 days ago

SmolVLM2 📺 Smallest video LM ever 🤏🏻

Collection

11 items • Updated Feb 25 • 82

upvoted 2 papers 10 days ago

An Empirical Study of GPT-4o Image Generation Capabilities

Paper • 2504.05979 • Published 15 days ago • 61

OmniSVG: A Unified Scalable Vector Graphics Generation Model

Paper • 2504.06263 • Published 15 days ago • 149