7 144 9

Robin Williams PRO

bfuzzy1

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

Agentless: Demystifying LLM-based Software Engineering Agents

updated a collection 4 days ago

Agents

upvoted a paper 4 days ago

Agent Laboratory: Using LLM Agents as Research Assistants

View all activity

Organizations

None yet

bfuzzy1's activity

upvoted a paper 3 days ago

Agentless: Demystifying LLM-based Software Engineering Agents

Paper • 2407.01489 • Published Jul 1, 2024 • 57

updated a collection 4 days ago

Agents

Collection

Collection of resources related to Agents. • 70 items • Updated 4 days ago • 5

upvoted 2 papers 4 days ago

Agent Laboratory: Using LLM Agents as Research Assistants

Paper • 2501.04227 • Published 8 days ago • 75

Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though

Paper • 2501.04682 • Published 7 days ago • 78

updated a collection 4 days ago

acheron-m

Collection

2 items • Updated 4 days ago

upvoted a paper 4 days ago

Entropy-Guided Attention for Private LLMs

Paper • 2501.03489 • Published 8 days ago • 12

updated 2 models 5 days ago

bfuzzy1/acheron-m1a-llama

Text Generation • Updated 5 days ago • 4

bfuzzy1/acheron-m

Text Generation • Updated 5 days ago • 153

upvoted a collection 5 days ago

LLM Reasoning Papers

Collection

Papers to improve reasoning capabilities of LLMs • 20 items • Updated about 2 hours ago • 100

upvoted 4 papers 7 days ago

HumanEval Pro and MBPP Pro: Evaluating Large Language Models on Self-invoking Code Generation

Paper • 2412.21199 • Published 16 days ago • 11

Training Software Engineering Agents and Verifiers with SWE-Gym

Paper • 2412.21139 • Published 16 days ago • 20

Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs

Paper • 2412.21187 • Published 16 days ago • 35

OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis

Paper • 2412.19723 • Published 19 days ago • 78

liked a dataset 7 days ago

KAKA22/CodeRM-UnitTest

Viewer • Updated 12 days ago • 77.2k • 65 • 3

liked a model 7 days ago

KAKA22/CodeRM-8B

Text Generation • Updated 12 days ago • 39 • 2

upvoted 5 papers 7 days ago

Dynamic Scaling of Unit Tests for Code Reward Modeling

Paper • 2501.01054 • Published 13 days ago • 16

CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings

Paper • 2501.01257 • Published 13 days ago • 46

SDPO: Segment-Level Direct Preference Optimization for Social Agents

Paper • 2501.01821 • Published 12 days ago • 18

Graph Generative Pre-trained Transformer

Paper • 2501.01073 • Published 13 days ago • 17

PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models

Paper • 2501.03124 • Published 9 days ago • 13