Agents - a Yoai Collection

Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

Yoai 's Collections

Agents

Agent-Cognition

Medical

Agents

updated Oct 29

A Zero-Shot Language Agent for Computer Control with Structured Reflection

Paper • 2310.08740 • Published Oct 12, 2023 • 14
AgentTuning: Enabling Generalized Agent Abilities for LLMs

Paper • 2310.12823 • Published Oct 19, 2023 • 35
AgentVerse: Facilitating Multi-Agent Collaboration and Exploring Emergent Behaviors

Paper • 2308.10848 • Published Aug 21, 2023 • 1
CLEX: Continuous Length Extrapolation for Large Language Models

Paper • 2310.16450 • Published Oct 25, 2023 • 9
An Early Evaluation of GPT-4V(ision)

Paper • 2310.16534 • Published Oct 25, 2023 • 21
Personas as a Way to Model Truthfulness in Language Models

Paper • 2310.18168 • Published Oct 27, 2023 • 5
ChatCoder: Chat-based Refine Requirement Improves LLMs' Code Generation

Paper • 2311.00272 • Published Nov 1, 2023 • 9
Tell Your Model Where to Attend: Post-hoc Attention Steering for LLMs

Paper • 2311.02262 • Published Nov 3, 2023 • 10
Ultra-Long Sequence Distributed Transformer

Paper • 2311.02382 • Published Nov 4, 2023 • 2
MFTCoder: Boosting Code LLMs with Multitask Fine-Tuning

Paper • 2311.02303 • Published Nov 4, 2023 • 4
Prompt Engineering a Prompt Engineer

Paper • 2311.05661 • Published Nov 9, 2023 • 20
Beyond ChatBots: ExploreLLM for Structured Thoughts and Personalized Model Responses

Paper • 2312.00763 • Published Dec 1, 2023 • 19
Merlin:Empowering Multimodal LLMs with Foresight Minds

Paper • 2312.00589 • Published Nov 30, 2023 • 24
Instruction-tuning Aligns LLMs to the Human Brain

Paper • 2312.00575 • Published Dec 1, 2023 • 11
DeepCache: Accelerating Diffusion Models for Free

Paper • 2312.00858 • Published Dec 1, 2023 • 21
PathFinder: Guided Search over Multi-Step Reasoning Paths

Paper • 2312.05180 • Published Dec 8, 2023 • 9
ReST meets ReAct: Self-Improvement for Multi-Step Reasoning LLM Agent

Paper • 2312.10003 • Published Dec 15, 2023 • 37
Faithful Persona-based Conversational Dataset Generation with Large Language Models

Paper • 2312.10007 • Published Dec 15, 2023 • 6
Supervised Knowledge Makes Large Language Models Better In-context Learners

Paper • 2312.15918 • Published Dec 26, 2023 • 8
Infinite-LLM: Efficient LLM Service for Long Context with DistAttention and Distributed KVCache

Paper • 2401.02669 • Published Jan 5 • 14
Bootstrapping LLM-based Task-Oriented Dialogue Agents via Self-Talk

Paper • 2401.05033 • Published Jan 10 • 16
LLaVA-φ: Efficient Multi-Modal Assistant with Small Language Model

Paper • 2401.02330 • Published Jan 4 • 14
Mobile-Agent: Autonomous Multi-Modal Mobile Device Agent with Visual Perception

Paper • 2401.16158 • Published Jan 29 • 19
Weaver: Foundation Models for Creative Writing

Paper • 2401.17268 • Published Jan 30 • 43
Tag-LLM: Repurposing General-Purpose LLMs for Specialized Domains

Paper • 2402.05140 • Published Feb 6 • 20
More Agents Is All You Need

Paper • 2402.05120 • Published Feb 3 • 51
Premise Order Matters in Reasoning with Large Language Models

Paper • 2402.08939 • Published Feb 14 • 27
A Human-Inspired Reading Agent with Gist Memory of Very Long Contexts

Paper • 2402.09727 • Published Feb 15 • 36
Chain-of-Thought Reasoning Without Prompting

Paper • 2402.10200 • Published Feb 15 • 104
In Search of Needles in a 10M Haystack: Recurrent Memory Finds What LLMs Miss

Paper • 2402.10790 • Published Feb 16 • 41
Evaluating Very Long-Term Conversational Memory of LLM Agents

Paper • 2402.17753 • Published Feb 27 • 18
GAIA: a benchmark for General AI Assistants

Paper • 2311.12983 • Published Nov 21, 2023 • 185
Learning to Decode Collaboratively with Multiple Language Models

Paper • 2403.03870 • Published Mar 6 • 18
SOTOPIA-π: Interactive Learning of Socially Intelligent Language Agents

Paper • 2403.08715 • Published Mar 13 • 20
Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models

Paper • 2403.12881 • Published Mar 19 • 16
Evolutionary Optimization of Model Merging Recipes

Paper • 2403.13187 • Published Mar 19 • 50
LLM Agent Operating System

Paper • 2403.16971 • Published Mar 25 • 65
AgentLite: A Lightweight Library for Building and Advancing Task-Oriented LLM Agent System

Paper • 2402.15538 • Published Feb 23 • 6
LLMs Simulate Big Five Personality Traits: Further Evidence

Paper • 2402.01765 • Published Jan 31
LLM Agents in Interaction: Measuring Personality Consistency and Linguistic Alignment in Interacting Populations of Large Language Models

Paper • 2402.02896 • Published Feb 5
Is Cognition and Action Consistent or Not: Investigating Large Language Model's Personality

Paper • 2402.14679 • Published Feb 22
PHAnToM: Personality Has An Effect on Theory-of-Mind Reasoning in Large Language Models

Paper • 2403.02246 • Published Mar 4 • 1
LLM Multi-Agent Systems: Challenges and Open Problems

Paper • 2402.03578 • Published Feb 5
AgentScope: A Flexible yet Robust Multi-Agent Platform

Paper • 2402.14034 • Published Feb 21 • 12
Social Skill Training with Large Language Models

Paper • 2404.04204 • Published Apr 5 • 15
Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models

Paper • 2404.18796 • Published Apr 29 • 68
Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models

Paper • 2405.01535 • Published May 2 • 119
Mixture-of-Agents Enhances Large Language Model Capabilities

Paper • 2406.04692 • Published Jun 7 • 55
Octo-planner: On-device Language Model for Planner-Action Agents

Paper • 2406.18082 • Published Jun 26 • 47
ROS-LLM: A ROS framework for embodied AI with task feedback and structured reasoning

Paper • 2406.19741 • Published Jun 28 • 59
LiteSearch: Efficacious Tree Search for LLM

Paper • 2407.00320 • Published Jun 29 • 37
Agentless: Demystifying LLM-based Software Engineering Agents

Paper • 2407.01489 • Published Jul 1 • 42
AriGraph: Learning Knowledge Graph World Models with Episodic Memory for LLM Agents

Paper • 2407.04363 • Published Jul 5 • 27
Stark: Social Long-Term Multi-Modal Conversation with Persona Commonsense Knowledge

Paper • 2407.03958 • Published Jul 4 • 18
LAMBDA: A Large Model Based Data Agent

Paper • 2407.17535 • Published Jul 24 • 34
PERSONA: A Reproducible Testbed for Pluralistic Alignment

Paper • 2407.17387 • Published Jul 24 • 18
GPUDrive: Data-driven, multi-agent driving simulation at 1 million FPS

Paper • 2408.01584 • Published Aug 2 • 7
Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks

Paper • 2408.03615 • Published Aug 7 • 30
Generating novel experimental hypotheses from language models: A case study on cross-dative generalization

Paper • 2408.05086 • Published Aug 9 • 4
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery

Paper • 2408.06292 • Published Aug 12 • 117
Benchmarking Agentic Workflow Generation

Paper • 2410.07869 • Published Oct 10 • 25
marcelbinz/Llama-3.1-Centaur-70B

Text Generation • Updated Nov 2 • 893 • 14

Collection guide
Browse collections

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs