gammatau (GammaTau AI)

noahshinn

authored a paper 8 months ago

$τ$-bench: A Benchmark for Tool-Agent-User Interaction in Real-World Domains

Paper • 2406.12045 • Published Jun 17, 2024 • 6

cassanof

updated a model 9 months ago

gammatau/verifier-6.7b-v1

Updated May 13, 2024

cassanof

authored a paper 11 months ago

StarCoder 2 and The Stack v2: The Next Generation

Paper • 2402.19173 • Published Feb 29, 2024 • 137

noahshinn

authored a paper 12 months ago

Can It Edit? Evaluating the Ability of Large Language Models to Follow Code Editing Instructions

Paper • 2312.12450 • Published Dec 11, 2023 • 1

cassanof

authored 5 papers 12 months ago

MultiPL-E: A Scalable and Extensible Approach to Benchmarking Neural Code Generation

Paper • 2208.08227 • Published Aug 17, 2022 • 1

Knowledge Transfer from High-Resource to Low-Resource Programming Languages for Code LLMs

Paper • 2308.09895 • Published Aug 19, 2023 • 1

Can It Edit? Evaluating the Ability of Large Language Models to Follow Code Editing Instructions

Paper • 2312.12450 • Published Dec 11, 2023 • 1

Type Prediction With Program Decomposition and Fill-in-the-Type Training

Paper • 2305.17145 • Published May 25, 2023

Reflexion: Language Agents with Verbal Reinforcement Learning

Paper • 2303.11366 • Published Mar 20, 2023 • 4

yuchiz

authored a paper about 1 year ago

Robust Prompt Optimization for Defending Language Models Against Jailbreaking Attacks

Paper • 2401.17263 • Published Jan 30, 2024 • 1

cassanof

updated a model about 1 year ago

gammatau/deepseek-1b-multipl-t-rkt

Text Generation • Updated Nov 17, 2023 • 23

noahshinn

authored 2 papers about 1 year ago

Type Prediction With Program Decomposition and Fill-in-the-Type Training

Paper • 2305.17145 • Published May 25, 2023

Reflexion: Language Agents with Verbal Reinforcement Learning

Paper • 2303.11366 • Published Mar 20, 2023 • 4

yuchiz

authored a paper over 1 year ago

Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models

Paper • 2310.04406 • Published Oct 6, 2023 • 8

cassanof

updated 2 models over 1 year ago

gammatau/starcoder-1b-fit

Text Generation • Updated Aug 5, 2023 • 11

gammatau/santacoder-ts-fim

Text Generation • Updated Jun 19, 2023 • 3

GammaTau AI

AI & ML interests

gammatau's activity

$τ$-bench: A Benchmark for Tool-Agent-User Interaction in Real-World Domains

gammatau/verifier-6.7b-v1

StarCoder 2 and The Stack v2: The Next Generation

Can It Edit? Evaluating the Ability of Large Language Models to Follow Code Editing Instructions

MultiPL-E: A Scalable and Extensible Approach to Benchmarking Neural Code Generation

Knowledge Transfer from High-Resource to Low-Resource Programming Languages for Code LLMs

Can It Edit? Evaluating the Ability of Large Language Models to Follow Code Editing Instructions

Type Prediction With Program Decomposition and Fill-in-the-Type Training

Reflexion: Language Agents with Verbal Reinforcement Learning

Robust Prompt Optimization for Defending Language Models Against Jailbreaking Attacks

gammatau/deepseek-1b-multipl-t-rkt

Type Prediction With Program Decomposition and Fill-in-the-Type Training

Reflexion: Language Agents with Verbal Reinforcement Learning

Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models

gammatau/starcoder-1b-fit

gammatau/santacoder-ts-fim

AI & ML interests

Team members 5

gammatau's activity