Kai Zuberbühler's picture

413 290

Kai Zuberbühler

kaizuberbuehler

·

k-zubi

AI & ML interests

language models, agents, image generation, music generation

Recent Activity

updated a collection 2 days ago

LM Prompt Engineering

updated a collection 2 days ago

upvoted a paper 2 days ago

Search-o1: Agentic Search-Enhanced Large Reasoning Models

View all activity

Organizations

None yet

kaizuberbuehler's activity

updated 2 collections 2 days ago

LM Prompt Engineering

28 items • Updated 2 days ago

Reasoning

37 items • Updated 2 days ago

upvoted a paper 2 days ago

Search-o1: Agentic Search-Enhanced Large Reasoning Models

Paper • 2501.05366 • Published 3 days ago • 47

updated a collection 3 days ago

Reasoning

37 items • Updated 2 days ago

upvoted a paper 3 days ago

Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs

Paper • 2412.21187 • Published 13 days ago • 34

updated 2 collections 3 days ago

Benchmarks

48 items • Updated 3 days ago • 1

Agents

67 items • Updated 3 days ago • 3

upvoted a paper 3 days ago

SOTOPIA: Interactive Evaluation for Social Intelligence in Language Agents

Paper • 2310.11667 • Published Oct 18, 2023 • 3

updated a collection 3 days ago

Agents

67 items • Updated 3 days ago • 3

upvoted a paper 3 days ago

SDPO: Segment-Level Direct Preference Optimization for Social Agents

Paper • 2501.01821 • Published 9 days ago • 18

updated 3 collections 3 days ago

Vision Language Models

53 items • Updated 3 days ago • 5

Reasoning

37 items • Updated 2 days ago

LM Training

64 items • Updated 3 days ago • 1

upvoted a paper 3 days ago

Virgo: A Preliminary Exploration on Reproducing o1-like MLLM

Paper • 2501.01904 • Published 9 days ago • 28

updated a collection 3 days ago

LM Training

64 items • Updated 3 days ago • 1

upvoted a paper 3 days ago

Scaling Laws for Floating Point Quantization Training

Paper • 2501.02423 • Published 7 days ago • 22

updated a collection 3 days ago

Reasoning

37 items • Updated 2 days ago

upvoted a paper 3 days ago

Test-time Computing: from System-1 Thinking to System-2 Thinking

Paper • 2501.02497 • Published 7 days ago • 33

updated a collection 3 days ago

Reasoning

37 items • Updated 2 days ago

upvoted a paper 3 days ago

BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning

Paper • 2501.03226 • Published 6 days ago • 33