-
Training Language Models to Self-Correct via Reinforcement Learning
Paper • 2409.12917 • Published • 138 -
Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution
Paper • 2409.12191 • Published • 76 -
Expect the Unexpected: FailSafe Long Context QA for Finance
Paper • 2502.06329 • Published • 126 -
Competitive Programming with Large Reasoning Models
Paper • 2502.06807 • Published • 67
Julian Wergieluk
jwergieluk
·
AI & ML interests
machine learning, mathematics, optimization
Recent Activity
updated
a collection
30 days ago
Papers inbox
updated
a collection
30 days ago
Papers inbox
updated
a collection
30 days ago
Papers inbox
Organizations
Collections
1
models
None public yet
datasets
None public yet