MBZUAI
/

bactrian-x-llama-7b-lora

Model card Files Files and versions Community

haonan-li commited on May 11, 2023

Commit

c4c541b

•

1 Parent(s): cbe5dbd

update model

Files changed (3) hide show

README.md +5 -3
adapter_config.json +2 -2
adapter_model.bin +1 -1

README.md CHANGED Viewed

@@ -2,6 +2,9 @@
 license: mit
 ---
 This repo contains a low-rank adapter (LoRA) for LLaMA-7b
 fit on the [Stanford-Alpaca-52k](https://github.com/tatsu-lab/stanford_alpaca)
 and [databricks-dolly-15k](https://github.com/databrickslabs/dolly/tree/master/data) data in 52 languages.
@@ -29,7 +32,6 @@ This version of the weights was trained with the following hyperparameters:
 - Lora _r_: 64
 - Lora target modules: q_proj, k_proj, v_proj, o_proj
-#### Current Training Steps: 21000
 That is:
@@ -40,7 +42,7 @@ python finetune.py \
     --batch_size=128 \
     --cutoff_len=512 \
     --group_by_length \
-    --output_dir='./bactrian-x-7b-lora' \
     --lora_target_modules='q_proj,k_proj,v_proj,o_proj' \
     --lora_r=64 \
     --micro_batch_size=32
@@ -57,7 +59,7 @@ Instructions for running it can be found at https://github.com/MBZUAI-nlp/Bactri
 ```
 @misc{bactrian,
-  author = {Haonan Li and Fajri Koto and Timothy Baldwin},
   title = {Bactrian-X: A Multilingual Replicable Instruction-Following Model},
   year = {2023},
   publisher = {GitHub},

 license: mit
 ---
+#### Current Training Steps: 40000
 This repo contains a low-rank adapter (LoRA) for LLaMA-7b
 fit on the [Stanford-Alpaca-52k](https://github.com/tatsu-lab/stanford_alpaca)
 and [databricks-dolly-15k](https://github.com/databrickslabs/dolly/tree/master/data) data in 52 languages.
 - Lora _r_: 64
 - Lora target modules: q_proj, k_proj, v_proj, o_proj
 That is:
     --batch_size=128 \
     --cutoff_len=512 \
     --group_by_length \
+    --output_dir='./bactrian-x-llama-7b-lora' \
     --lora_target_modules='q_proj,k_proj,v_proj,o_proj' \
     --lora_r=64 \
     --micro_batch_size=32
 ```
 @misc{bactrian,
+  author = {Haonan Li and Fajri Koto and Minghao Wu and Alham Fikri Aji and Timothy Baldwin},
   title = {Bactrian-X: A Multilingual Replicable Instruction-Following Model},
   year = {2023},
   publisher = {GitHub},

adapter_config.json CHANGED Viewed

@@ -10,10 +10,10 @@
   "peft_type": "LORA",
   "r": 64,
   "target_modules": [
-    "[q_proj",
     "k_proj",
     "v_proj",
-    "o_proj]"
   ],
   "task_type": "CAUSAL_LM"
 }

   "peft_type": "LORA",
   "r": 64,
   "target_modules": [
+    "q_proj",
     "k_proj",
     "v_proj",
+    "o_proj"
   ],
   "task_type": "CAUSAL_LM"
 }

adapter_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a79d74d6cfed583c0a176438158a983133a2016ed923d25347e4335d95c7aab8
 size 268527949

 version https://git-lfs.github.com/spec/v1
+oid sha256:dabb634e58bfa65d99b894b6f5a390af86b12bf57246518b9a4eddaffdf471fc
 size 268527949