Update README.md
Browse files
README.md
CHANGED
@@ -2,6 +2,7 @@
|
|
2 |
license: apache-2.0
|
3 |
datasets:
|
4 |
- krasserm/gba-trajectories
|
|
|
5 |
---
|
6 |
A planner LLM [fine-tuned on synthetic trajectories](https://krasserm.github.io/2024/05/31/planner-fine-tuning/) from an agent simulation. It can be used in [ReAct](https://arxiv.org/abs/2210.03629)-style LLM agents where [planning is separated from function calling](https://krasserm.github.io/2024/03/06/modular-agent/). Trajectory generation and planner fine-tuning are described in the [bot-with-plan](https://github.com/krasserm/bot-with-plan) project.
|
7 |
|
@@ -112,4 +113,4 @@ The planner learned a (static) set of available tools during fine-tuning. These
|
|
112 |
| `use_bash` | Useful for executing commands in a Linux bash. |
|
113 |
| `final_answer` | Useful for providing the final answer to a request. Must always be used in the last step. |
|
114 |
|
115 |
-
The framework provided by the [bot-with-plan](https://github.com/krasserm/bot-with-plan) project can easily be adjusted to a different set of tools for specialization to other application domains.
|
|
|
2 |
license: apache-2.0
|
3 |
datasets:
|
4 |
- krasserm/gba-trajectories
|
5 |
+
library_name: peft
|
6 |
---
|
7 |
A planner LLM [fine-tuned on synthetic trajectories](https://krasserm.github.io/2024/05/31/planner-fine-tuning/) from an agent simulation. It can be used in [ReAct](https://arxiv.org/abs/2210.03629)-style LLM agents where [planning is separated from function calling](https://krasserm.github.io/2024/03/06/modular-agent/). Trajectory generation and planner fine-tuning are described in the [bot-with-plan](https://github.com/krasserm/bot-with-plan) project.
|
8 |
|
|
|
113 |
| `use_bash` | Useful for executing commands in a Linux bash. |
|
114 |
| `final_answer` | Useful for providing the final answer to a request. Must always be used in the last step. |
|
115 |
|
116 |
+
The framework provided by the [bot-with-plan](https://github.com/krasserm/bot-with-plan) project can easily be adjusted to a different set of tools for specialization to other application domains.
|