13 19 33

Kaiyan Zhang

iseesaw

iseesaw

AI & ML interests

None yet

Recent Activity

authored a paper about 13 hours ago

TTRL: Test-Time Reinforcement Learning

upvoted a paper about 19 hours ago

TTRL: Test-Time Reinforcement Learning

commented on a paper about 19 hours ago

TTRL: Test-Time Reinforcement Learning

View all activity

Organizations

iseesaw's activity

upvoted a paper about 19 hours ago

TTRL: Test-Time Reinforcement Learning

Paper • 2504.16084 • Published 1 day ago • 57

upvoted a paper 6 days ago

ReTool: Reinforcement Learning for Strategic Tool Use in LLMs

Paper • 2504.11536 • Published 8 days ago • 57

upvoted a collection 24 days ago

Gemma 3 Release

Collection

24 items • Updated 5 days ago • 342

upvoted a paper 30 days ago

Video-T1: Test-Time Scaling for Video Generation

Paper • 2503.18942 • Published about 1 month ago • 88

upvoted a paper about 1 month ago

Technologies on Effectiveness and Efficiency: A Survey of State Spaces Models

Paper • 2503.11224 • Published Mar 14 • 27

upvoted a collection about 2 months ago

Qwen2.5-Coder

Collection

Code-specific model series based on Qwen2.5 • 40 items • Updated Nov 28, 2024 • 308

upvoted a paper 2 months ago

Diverse Inference and Verification for Advanced Reasoning

Paper • 2502.09955 • Published Feb 14 • 18

upvoted 2 articles 2 months ago

Article

Our Transformers Code Agent beats the GAIA benchmark!

Jul 1, 2024

• 80

Article

Open-source DeepResearch – Freeing our search agents

Feb 4

• 1.22k

upvoted a paper 2 months ago

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Paper • 2502.06703 • Published Feb 10 • 151

upvoted a paper 3 months ago

MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding

Paper • 2501.18362 • Published Jan 30 • 22

upvoted a collection 4 months ago

Reasoning Datasets

Collection

Reasoning datasets that are trending 🔥 • 10 items • Updated Jan 3 • 24

upvoted a paper 4 months ago

Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization

Paper • 2412.17739 • Published Dec 23, 2024 • 42

upvoted a paper 5 months ago

Free Process Rewards without Process Labels

Paper • 2412.01981 • Published Dec 2, 2024 • 35

upvoted a paper 10 months ago

Towards Building Specialized Generalist AI with System 1 and System 2 Fusion

Paper • 2407.08642 • Published Jul 11, 2024 • 11