AgentEvol-7B
π Paper β’ π Project Page β’ π» [Github Repo] β’ π [Trajectory Dataset] β’ π [Eval Benchmark] β’ π€ Model (AgentEvol-7B)
AgentEvol is a novel method to evolve generall-capable LLM-based agents across multiple environments. AgentEvol first trains a base generally-capable agent with behavioral cloning, equipping it with basic abability and prior knowledgs. Subsequently, the agent is allowed to perform exploration and learning acorss various tasks and environments.
AgentEvol-7B is trained with the AgentEvol algorithm on Llama-2-Chat-7B. The model is first trained on the AgentTraj set with behavioural cloning. Next it performs exploration and learning from a broader set of instructions. After evolution, its performance outperforms SOTA models on many tasks.
π Citation
@misc{xi2024agentgym,
title={AgentGym: Evolving Large Language Model-based Agents across Diverse Environments},
author={Zhiheng Xi and Yiwen Ding and Wenxiang Chen and Boyang Hong and Honglin Guo and Junzhe Wang and Dingwen Yang and Chenyang Liao and Xin Guo and Wei He and Songyang Gao and Lu Chen and Rui Zheng and Yicheng Zou and Tao Gui and Qi Zhang and Xipeng Qiu and Xuanjing Huang and Zuxuan Wu and Yu-Gang Jiang},
year={2024},
eprint={2406.04151},
archivePrefix={arXiv},
primaryClass={cs.AI}
}
- Downloads last month
- 541