lambdax commited on
Commit
c86b317
1 Parent(s): 1444143

init readme & config

Browse files
Files changed (2) hide show
  1. README.md +31 -0
  2. config.json +1 -1
README.md ADDED
@@ -0,0 +1,31 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ datasets:
3
+ - THUDM/AgentInstruct
4
+ ---
5
+
6
+ ## AgentLM-13B
7
+
8
+ <p align="center">
9
+ 🤗 <a href="https://huggingface.co/datasets/THUDM/AgentInstruct" target="_blank">[Dataset] </a> • 💻 <a href="https://github.com/THUDM/AgentTuning" target="_blank">[Github Repo]</a> • 📌 <a href="https://THUDM.github.io/AgentTuning/" target="_blank">[Project Page]</a> • 📃 <a href="https://arxiv.org/abs/TODO" target="_blank">[Paper]</a>
10
+ </p>
11
+
12
+ **AgentTuning** represents the very first attempt to instruction-tune LLMs using interaction trajectories across multiple agent tasks. Evaluation results indicate that AgentTuning enables the agent capabilities of LLMs with robust generalization on unseen agent tasks while remaining good on general language abilities. We have open-sourced the AgentInstruct dataset and AgentLM.
13
+
14
+ ## Models
15
+
16
+ **AgentLM** models are produced by mixed training on AgentInstruct dataset and ShareGPT dataset from Llama-2-chat models.
17
+
18
+ The models follow the conversation format of [Llama-2-chat](https://huggingface.co/blog/llama2#how-to-prompt-llama-2), with system prompt fixed as
19
+
20
+ ```
21
+ You are a helpful, respectful and honest assistant.
22
+ ```
23
+
24
+ 7B, 13B, and 70B models are available on Huggingface model hub.
25
+
26
+ |Model|Huggingface Repo|
27
+ |---|---|
28
+ |AgentLM-7B| [🤗Huggingface Repo](https://huggingface.co/THUDM/agentlm-7b) |
29
+ |AgentLM-13B| [🤗Huggingface Repo](https://huggingface.co/THUDM/agentlm-13b) |
30
+ |AgentLM-70B| [🤗Huggingface Repo](https://huggingface.co/THUDM/agentlm-70b) |
31
+
config.json CHANGED
@@ -1,5 +1,5 @@
1
  {
2
- "_name_or_path": "THUDM/AgentLlama-13b",
3
  "architectures": [
4
  "LlamaForCausalLM"
5
  ],
 
1
  {
2
+ "_name_or_path": "THUDM/agentlm-13b",
3
  "architectures": [
4
  "LlamaForCausalLM"
5
  ],