Agentic-ly agentic - a bfuzzy1 Collection

bfuzzy1 's Collections

RL

acheron

Gunny

Agents

Agentic-ly agentic

Don't hate - evaluate

Generation Nation

Nifty

Agentic-ly agentic

updated Dec 20, 2024

Automated Design of Agentic Systems

Paper • 2408.08435 • Published Aug 15, 2024 • 39
On the limits of agency in agent-based models

Paper • 2409.10568 • Published Sep 14, 2024 • 13
On the Diagram of Thought

Paper • 2409.10038 • Published Sep 16, 2024 • 14
DSBench: How Far Are Data Science Agents to Becoming Data Science Experts?

Paper • 2409.07703 • Published Sep 12, 2024 • 67
Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale

Paper • 2409.08264 • Published Sep 12, 2024 • 45
Agent Workflow Memory

Paper • 2409.07429 • Published Sep 11, 2024 • 29
Paper Copilot: A Self-Evolving and Efficient LLM System for Personalized Academic Assistance

Paper • 2409.04593 • Published Sep 6, 2024 • 26
Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19, 2024 • 137
Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale

Paper • 2409.17115 • Published Sep 25, 2024 • 62
MTU-Bench: A Multi-granularity Tool-Use Benchmark for Large Language Models

Paper • 2410.11710 • Published Oct 15, 2024 • 20