NesTools: A Dataset for Evaluating Nested Tool Learning Abilities of Large Language Models Paper • 2410.11805 • Published Oct 15 • 11 • 4
NesTools: A Dataset for Evaluating Nested Tool Learning Abilities of Large Language Models Paper • 2410.11805 • Published Oct 15 • 11
NesTools: A Dataset for Evaluating Nested Tool Learning Abilities of Large Language Models Paper • 2410.11805 • Published Oct 15 • 11
NesTools: A Dataset for Evaluating Nested Tool Learning Abilities of Large Language Models Paper • 2410.11805 • Published Oct 15 • 11 • 4
UI Agent Collection a collection of algorithmic agents for user interfaces/interactions and program synthesis • 202 items • Updated 2 days ago • 25
Mirror Collection Mirror: A Universal Framework for Various Information Extraction Tasks https://arxiv.org/abs/2311.05419 • 5 items • Updated Oct 11
Mirror Collection Mirror: A Universal Framework for Various Information Extraction Tasks https://arxiv.org/abs/2311.05419 • 5 items • Updated Oct 11
ConflictBank: A Benchmark for Evaluating the Influence of Knowledge Conflicts in LLM Paper • 2408.12076 • Published Aug 22 • 12