-
BLINK: Multimodal Large Language Models Can See but Not Perceive
Paper • 2404.12390 • Published • 22 -
GAIA: a benchmark for General AI Assistants
Paper • 2311.12983 • Published • 170 -
RULER: What's the Real Context Size of Your Long-Context Language Models?
Paper • 2404.06654 • Published • 30 -
CantTalkAboutThis: Aligning Language Models to Stay on Topic in Dialogues
Paper • 2404.03820 • Published • 20
Kai Zuberbühler
kaizuberbuehler
·
AI & ML interests
language models, agents, image generation, music generation
Organizations
None yet
Collections
17
-
EdgeFusion: On-Device Text-to-Image Generation
Paper • 2404.11925 • Published • 19 -
Dynamic Typography: Bringing Words to Life
Paper • 2404.11614 • Published • 40 -
ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback
Paper • 2404.07987 • Published • 45 -
Applying Guidance in a Limited Interval Improves Sample and Distribution Quality in Diffusion Models
Paper • 2404.07724 • Published • 10
models
None public yet
datasets
None public yet