add: readme file
Browse files
README.md
ADDED
@@ -0,0 +1,13 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
## AgentEvol-7B
|
2 |
+
|
3 |
+
<p align="center">
|
4 |
+
π <a href="TODO" target="_blank">Paper</a > β’ π <a href="https://agentgym.github.io/" target="_blank">Project Page</a > β’ π» <a href="https://github.com/WooooDyy/AgentGym" target="_blank">[Github Repo]</a> β’ π <a href="https://huggingface.co/datasets/AgentGym/AgentTraj-L" target="_blank">[Trajectory Dataset]</a > β’ π <a href="https://huggingface.co/datasets/AgentGym/AgentEval" target="_blank">[Eval Benchmark]</a> β’ π€ <a href="https://huggingface.co/AgentGym/AgentEvol-7B" target="_blank">Model (AgentEvol-7B)</a ><br>
|
5 |
+
</p >
|
6 |
+
|
7 |
+
**AgentEvol** is a novel method to evolve generall-capable LLM-based agents across multiple environments. AgentEvol first trains a base generally-capable agent with behavioral cloning, equipping it with basic abability and prior knowledgs. Subsequently, the agent is allowed to perform exploration and learning acorss various tasks and environments.
|
8 |
+
|
9 |
+
**AgentEvol-7B** is trained with the AgentEvol algorithm on Llama-2-Chat-7B. The model is first trained on the AgentTraj set with behavioural cloning. Next it performs exploration and learning from a broader set of instructions. After evolution, its performance outperforms SOTA models on many tasks.
|
10 |
+
|
11 |
+
## π Citation
|
12 |
+
|
13 |
+
- TODO
|