llm-agents (LLM-Agents)

zubingou

authored a paper 3 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 381

Lin1557

authored 3 papers 3 months ago

PTD-SQL: Partitioning and Targeted Drilling with LLMs in Text-to-SQL

Paper • 2409.14082 • Published Sep 21, 2024

URSA: Understanding and Verifying Chain-of-thought Reasoning in Multimodal Mathematics

Paper • 2501.04686 • Published Jan 8 • 54

Critical Tokens Matter: Token-Level Contrastive Estimation Enhence LLM's Reasoning Capability

Paper • 2411.19943 • Published Nov 29, 2024 • 63

Lin1557

authored a paper 4 months ago

CriticBench: Benchmarking LLMs for Critique-Correct Reasoning

Paper • 2402.14809 • Published Feb 22, 2024 • 3

zubingou

authored a paper 8 months ago

DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search

Paper • 2408.08152 • Published Aug 15, 2024 • 58

zubingou

authored a paper 10 months ago

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

Paper • 2406.11931 • Published Jun 17, 2024 • 64

zubingou

updated a dataset about 1 year ago

llm-agents/CriticBench

Viewer • Updated Feb 23, 2024 • 3.83k • 167 • 11

syhia

authored 6 papers about 1 year ago

CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing

Paper • 2305.11738 • Published May 19, 2023 • 8

Synthetic Prompting: Generating Chain-of-Thought Demonstrations for Large Language Models

Paper • 2302.00618 • Published Feb 1, 2023

Enhancing Retrieval-Augmented Large Language Models with Iterative Retrieval-Generation Synergy

Paper • 2305.15294 • Published May 24, 2023 • 1

zubingou

authored a paper over 1 year ago

ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving

Paper • 2309.17452 • Published Sep 29, 2023 • 3

zubingou

updated 5 models over 1 year ago

llm-agents/tora-70b-v1.0

Text Generation • Updated Oct 8, 2023 • 706 • 21

llm-agents/tora-13b-v1.0

Text Generation • Updated Oct 8, 2023 • 718 • 6

llm-agents/tora-code-7b-v1.0

Text Generation • Updated Oct 8, 2023 • 749 • 18

llm-agents/tora-code-13b-v1.0

Text Generation • Updated Oct 8, 2023 • 762 • 15

llm-agents/tora-code-34b-v1.0

Text Generation • Updated Oct 8, 2023 • 721 • 14

LLM-Agents

AI & ML interests

llm-agents's activity

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

PTD-SQL: Partitioning and Targeted Drilling with LLMs in Text-to-SQL

URSA: Understanding and Verifying Chain-of-thought Reasoning in Multimodal Mathematics

Critical Tokens Matter: Token-Level Contrastive Estimation Enhence LLM's Reasoning Capability

CriticBench: Benchmarking LLMs for Critique-Correct Reasoning

DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

llm-agents/CriticBench

CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing

Synthetic Prompting: Generating Chain-of-Thought Demonstrations for Large Language Models

Enhancing Retrieval-Augmented Large Language Models with Iterative Retrieval-Generation Synergy

ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving

Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human Annotations

DeepSeek LLM: Scaling Open-Source Language Models with Longtermism

ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving

llm-agents/tora-70b-v1.0

llm-agents/tora-13b-v1.0

llm-agents/tora-code-7b-v1.0

llm-agents/tora-code-13b-v1.0

llm-agents/tora-code-34b-v1.0

AI & ML interests

Team members 3

llm-agents's activity