Blackroot
/

Llama-3-8B-Abomination-LORA

Model card Files Files and versions Community

Blackroot commited on May 28

Commit

c99e988

•

1 Parent(s): 7e17501

Update README.md

Files changed (1) hide show

README.md +14 -0

README.md CHANGED Viewed

@@ -16,6 +16,20 @@ Merge LORA into instruct model -- 100 MB of structured story-instruct data:
 Trained using <https://github.com/unslothai/unsloth>
 Rough script:
 ```python
 trainer = SFTTrainer(
     model = model,
     train_dataset = train_dataset,

 Trained using <https://github.com/unslothai/unsloth>
 Rough script:
 ```python
+model = FastLanguageModel.get_peft_model(
+    model,
+    r = 64,
+    target_modules = ["q_proj", "v_proj", "k_proj", "o_proj", "gate_proj", "up_proj", "down_proj"],
+    lora_alpha = 32,
+    lora_dropout = 0.05, # 0 for base pretraining
+    bias = "none",
+    use_gradient_checkpointing = "unsloth",
+    random_state = 3407,
+    max_seq_length = max_seq_length,
+    use_rslora = True,
+    loftq_config = None,
+)
 trainer = SFTTrainer(
     model = model,
     train_dataset = train_dataset,