cindywen commited on
Commit
18fc70f
β€’
1 Parent(s): 91720ce

add: readme file

Browse files
Files changed (1) hide show
  1. README.md +13 -0
README.md ADDED
@@ -0,0 +1,13 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ## AgentEvol-7B
2
+
3
+ <p align="center">
4
+ πŸ“ƒ <a href="TODO" target="_blank">Paper</a > β€’ 🌐 <a href="https://agentgym.github.io/" target="_blank">Project Page</a > β€’ πŸ’» <a href="https://github.com/WooooDyy/AgentGym" target="_blank">[Github Repo]</a> β€’ πŸ“š <a href="https://huggingface.co/datasets/AgentGym/AgentTraj-L" target="_blank">[Trajectory Dataset]</a > β€’ πŸ“ˆ <a href="https://huggingface.co/datasets/AgentGym/AgentEval" target="_blank">[Eval Benchmark]</a> β€’ πŸ€— <a href="https://huggingface.co/AgentGym/AgentEvol-7B" target="_blank">Model (AgentEvol-7B)</a ><br>
5
+ </p >
6
+
7
+ **AgentEvol** is a novel method to evolve generall-capable LLM-based agents across multiple environments. AgentEvol first trains a base generally-capable agent with behavioral cloning, equipping it with basic abability and prior knowledgs. Subsequently, the agent is allowed to perform exploration and learning acorss various tasks and environments.
8
+
9
+ **AgentEvol-7B** is trained with the AgentEvol algorithm on Llama-2-Chat-7B. The model is first trained on the AgentTraj set with behavioural cloning. Next it performs exploration and learning from a broader set of instructions. After evolution, its performance outperforms SOTA models on many tasks.
10
+
11
+ ## πŸ”– Citation
12
+
13
+ - TODO