-
Agent Laboratory: Using LLM Agents as Research Assistants
Paper • 2501.04227 • Published • 81 -
Search-o1: Agentic Search-Enhanced Large Reasoning Models
Paper • 2501.05366 • Published • 80 -
Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training
Paper • 2501.11425 • Published • 76 -
Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments
Paper • 2501.10893 • Published • 22
Shyam Sunder Kumar
theainerd
AI & ML interests
Natural Language Processing
Recent Activity
View all activity
Organizations
Collections
4
-
Training Large Language Models to Reason in a Continuous Latent Space
Paper • 2412.06769 • Published • 75 -
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
Paper • 2408.03314 • Published • 54 -
Evolving Deeper LLM Thinking
Paper • 2501.09891 • Published • 98 -
Kimi k1.5: Scaling Reinforcement Learning with LLMs
Paper • 2501.12599 • Published • 60
models
2
datasets
None public yet